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Abstract 

We study a class of Monte Carlo algorithms for the nonlinear o"- model, based 
on a Wolff-type embedding of Ising spins into the target manifold M. We argue 
heuristically that, at least for an asymptotically free model, such an algorithm 
can have dynamic critical exponent z <C 2 only if the embedding is based on an 
(involutive) isometry of M whose fixed-point manifold has codimension 1. Such 
an isometry exists only if the manifold is a discrete quotient of a product of 
spheres. Numerical simulations of the idealized codimension-2 algorithm for the 
two-dimensional 0(4)-symmetric a-model yield -2j„t^;v(2 = 1.5 it 0.5 (subjective 
68% confidence interval), in agreement with our heuristic argument. 



* Submitted to Nuclear Physics B. 



1 Introduction 



Wolff HI, ^, 1^ has recently proposed an extraordinarily efficient collective-mode Monte 
Carlo algorithm for simulating the nonlinear cr-model taking values in the sphere 
S^~^ and having symmetry group 0{N) [also called the A^- vector model]. Numerical 
tests of the dynamic critical behavior of this algorithm show the complete or almost 
complete absence of critical slowing-down (2; < 0.1) for two-dimensional models with 

= 2, 3, 4 H I, I, I, I, and a small but apparently nonzero critical slowing-down 
{z ^ 0.25 — 0.5) for the three-dimensional XY model 

In this paper0 

we consider generalizations of Wolff's algorithm to cr-models taking values in man- 
ifolds other than spheres. (We see this as a first step toward generalizing the Wolff 
algorithm to lattice gauge theories.) Our conclusion is somewhat surprising: we ar- 
gue that the generalized Wolff algorithm can work well (i.e. have z <^ 2) only if the 
manifold is a discrete quotient of a product of spheres (e.g. real projective space). 
This conclusion is based on a combination of a heuristic argument and a rigorous 
mathematical theorem; it is supported by a numerical test in one prototypical case. 

Wolff's algorithm for the A^-vector model is based on an embedding of Ising spins 
{e} into A^-component continuous spins {cr} according to 

= (7^ + e^\(Tl\r , (1.1) 

where r is a unit vector chosen randomly on S'^~^, cr^ = cr^ — (cTx ■ r)r and cr^ = 
(cTj. ■ r)r are the components of cr^ perpendicular and parallel to r, and Ex = sgn(cra. ■ 
r) = ±1. Flipping the Ising spin Ex corresponds to a reflection of cTx in the hyperplane 
perpendicular to r. With {cr-*-} and {|crll|} held fixed, the cx-model Hamiltonian 

^(M) = -Pj:^'^-^y (1-2) 

reduces to the ferromagnetic random-bond Ising model (in zero magnetic field) defined 
by 

= - 51 JxySxSy + COUSt , (1.3) 

i^y) 

where Jxy = I3\crx ■ ^\Wy ■ r|. The Ising model (|1.3| ) can then be simulated by any 
legitimate Monte Carlo algorithm, such as the the Swendsen-Wang (SW) algorithm 
0, |10| or its single-cluster variant [jl], |lT], |12|, |13 . 



The dynamic critical behavior of Wolff-type algorithms is determined by the com- 
bined effect of two completely distinct issues: 

i) How well the embedding ( |1.1D succeeds in "encoding" the important large-scale 
collective modes of the cr-model into the Ising variables {e}. 



^But, regarding [|| ^, see our discussion in Section || below. 

preliminary version of this work was reported at the Lattice '90 conference 
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ii) How well any given algorithm for Ising models (e.g. standard SW or single- 
cluster SW) succeeds in updating the spins {e} governed by the (random) 
Hamiltonian ( |1.3| ). 

We wish to emphasize the importance of studying these questions separately. If the 
physically relevant large-scale collective motions of the cr-model cannot be obtained 
by varying the {e} at fixed {cr-*-, |crll|}, then the Wolff algorithm will have severe 
critical slowing-down no matter what method is used to update the Ising variables 
{e}. On the other hand, if the Wolff algorithm with a particular choice of Ising- 
updating method shows severe critical slowing-down, this does not necessarily mean 
that the embedding ( p. . 1| ) works badly: the poor performance might be due to slow 
decorrelation in the Ising-updating subroutine, and could possibly be remedied by 
switching to a better Ising algorithm. 

In this paper our goal is to study the embedding defined by ( |1.1|) or its generaliza- 
tions, independently of the question of how the induced Ising model is to be updated. 
To address this question, it is conceptually useful to consider the idealized Wolff al- 
gorithm, which for the A^-vector model goes as follows: a vector r is chosen randomly 
from the unit sphere S'^~^; and a new configuration of Ising spins {e}, independent of 
the old configuration, is generated with probabilities given by the Hamiltonian (|1.3|) . 
Of course, such an algorithm is not practical, but that is not its role. Rather, it serves 
as a standard of comparison (and presumed lower bound on the autocorrelation time) 
for all algorithms based on the embedding ( |1 . 1| ) . If the idealized Wolff algorithm per- 
forms badly, then so must any algorithm based on the given embedding. On the other 
hand, if the idealized Wolff algorithm performs well, then it is clearly worthwhile to 
seek (if necessary) new Ising-model algorithms capable of simulating efficiently the 
induced Ising Hamiltonian. 

To approximate in practice the idealized Wolff algorithm, we update the {e} 
configuration by Nhu hits of some chosen Ising-model algorithm (e.g. standard SW) 
and extrapolate to = oo.0 To be sure, this test procedure can be very time- 
consuming. But it is essential if we wish to obtain physical insight into the embedding. 

The extraordinary performance of the Wolff algorithm for A^-vector models has 
spurred attempts (so far unsuccessful) to generalize it to lattice gauge theories. Gauge 
theories differ from A^-vector models in two ways: 

a) The field takes values in a group rather than a sphere. [U{1) and SU{2) are 
spheres, but higher Lie groups are not.] 

b) The field is a 1-form rather than a 0-form, i.e. it lives on links rather than sites. 
Correspondingly, the energy is the curl of the field rather than its gradient, and 
it lives on plaquettes rather than links. As a result, the theory has a local gauge 
invariance rather than just a global symmetry. 

The deep physical difference between gauge and spin models is, of course, item (b). 
The fact of gauge invariance, and the transverseness of physical excitations in a gauge 

''Preferably we would perform simulations for successively increasing values of Nhu until the 
autocorrelation time is constant within error bars. However, this may not always be feasible. 
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theory, will impose severe constraints, we believe, on the as-yet-unknown analogue of 
the embedding (|1.1| ) [if indeed such an analogue exists] .0 At present we have little 
to say in this direction (though some insight might possibly be gleaned from the 
Swendsen-Wang algorithm for Potts lattice gauge theories [0, |1^). In this paper we 
address the less profound, but still highly nontrivial, problem (a). To do this, we ask 
whether the embedding can be generalized to nonlinear a-models with values in 
manifolds other than spheres — such as SU{N) for > 3 — and, if so, what is the 
dynamic critical behavior of the corresponding idealized Wolff algorithm. 

Our approach is as follows: First we ask what are the fundamental properties of 
the embedding ( |1 . 1|) that cause the Wolff algorithm to work so well. Then we ask 
whether embeddings having these properties exist also in other Riemannian manifolds 
M; this is a question in differential geometry to which we are able to give a fairly 
complete answer. Finally, we perform a numerical study to test (in one case) whether 
our theoretical reasoning is correct. The conclusion of this analysis is quite surprising: 
roughly speaking, we find that a generalized Wolff algorithm can work well (i.e. have 
2; -C 2) only if the manifold M is either a sphere, a product of spheres, or the quotient 
of such a space by a discrete group (for example, real projective space RP^~^). If 
correct, this conclusion is quite disappointing, and lends renewed impetus to other 



classes of collective-mode algorithms such as multi-grid Monte Carlo [|T^ |T8|, |19 
and Fourier acceleration |^0|, ^ ^ 



The plan of this paper is as follows: In Section ^ we define generalized Wolff- 
type embedding algorithms for a-models taking values in a Riemannian manifold 
M. We argue heuristically that such an algorithm will work well — at least for 
asymptotically free cx-models — only if the embedding is based on an (involutive) 
isometry of M whose fixed-point manifold has codimension 1. This argument is the 
key physical idea of this paper. In Section | we study the conditions under which 
such an isometry can exist. In Section ^ we test our heuristic argument with a large- 
scale numerical study of the codimension-2 idealized embedding algorithm for the 
two-dimensional A^-vector model with A^ = 4. In Section ^ we discuss our results. 
Appendix |^ collects some results from topology and differential geometry that are 
essential to our argument in Sections || and |^. Some of these results are difficult to 
find in the mathematical literature, and at least a few appear to be new. Appendix 
P contains a complete classification of the involutive isometrics for the physically 
important case of SU{N). Much to our surprise, we have been unable to find this 
classification anywhere in the mathematical or physical literature. 

2 Generalized Embedding Algorithms 

2.1 Wolff algorithm for the A^^- vector model 

The single-spin configuration space of the A^-vector model is the sphere 

5^-1 = {(re R^: \(t\ = 1} , (2.1) 

^The same issue arises in devising multi-grid algorithms for gauge theories Section V]. 
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and the key object of the Wolff algorithm is the map of reflection in the equator: 

= (2.2) 

(This corresponds to r = (1, 0, . . . , 0); obviously the case of arbitrary r can be obtained 
by conjugating with a rotation.) The map T has two key properties: 

a) T is an isometry (i.e. it preserves distances). 

b) The fixed-point set Fix(T) = {cr G S^~'^: Tcr = cr} — which is of course the 
equator — has codimension 1. 

We argue that these are the crucial properties that make the Wolff algorithm work 
so well. 

Because T is an isometry, a global application of T is an exact symmetry (zero 
change in energy), and the application of T in a large but bounded region costs only 
a surface energy. (If T were not an isometry, then the application of T in a region 
would cost a bulk energy, and the idealized Wolff update would probably be unable 
to make any significant large-scale change in the {cr} configuration.) 

To see the importance of the codimension- 1 property, let us review our heuris- 
tic understanding |Q of why the Wolff algorithm works so well. Consider a slowly 
varying spin configuration {cr}. Since Jxy = (3\crx ■ ^\Wy ■ r|; the Hamiltonian (|1.3| ) 
almost decouples along the surfaces where cr ■ r 0, i.e. where cr is on or near the 
equator. These surfaces (= codimension-1 submanifolds) divide x-space into discon- 
nected regions Ri in which cr-r > Oorcr-r < 0. Provided that we are working 
in an non-magnetized phase (i.e. there is no preferred orientation for the spins), the 
largest of the regions Ri will presumably have linear size on the order of the correla- 
tion length ^. In particular, ifl-C^<L/2 (here L is the lattice linear size), then 
there will be, with high probability, at least two disjoint large regions R^. Now let 
us apply an an idealized Wolff update: the spins {e} will be given new values, and 
these values will be almost independent in distinct regions Ri. That is, the choice to 
reflect or not reflect a spin cr will have been made almost independently in distinct 
regions Ri (but very coherently within each Ri). If the original configuration {cr} is 
a long- wavelength spin wave, then these almost-independent reflections correspond 
roughly to a long-wavelength collective-mode change in {cr} (Figure p. We predict, 
therefore, that the idealized Wolff algorithm will have dynamic critical exponent z 
much less than 2, and quite possibly z ~ 0. This prediction has been confirmed 
numerically in the two-dimensional A^- vector models with iV = 2,3,4 |^, ^ ^, 
and in the three-dimensional model with N = 2 [^0.0 

The assumption in this argument that {cr} is slowly varying requires some clarifi- 
cation.!^ Consider first the case in which the critical point is at zero temperature {(3c = 
oo); this case corresponds to asymptotically free theories such as the two-dimensional 

^Regarding j^, 0, see our discussion in Section || below. 

® We did not fully understand these subtleties when we wrote (Quite possibly we still 

don't.) 
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Figure 1: Action of the Wolff algorithm on a long- wavelength spin wave. For 
simplicity, both spin space (cr) and physical space {x) are depicted as one-dimensional. 



A^- vector models for N > 3. For such theories, the typical spin configurations become 
smooth as P ^ Pc — oo: more precisely, for a nearest-neighbor pair of sites x,y, we 
have {(Tx ■ tTy) ~ 1 — c/ (3 for a (calculable) constant c. That is, the typical angle 
between nearest-neighbor spins is of order 1/^//?; so on bonds {xy) where cr^,- and CTy 
are near the equator, we typically have J^y = I3\(Tx ■ rllcTy ■ i"| ~ ~ 1- Note 

that this behavior occurs not only on the bonds {xy) where cr crosses the equator 
[i.e. for which [cTx ■ r){ay • r) < 0], but also on the nearby bonds where cr remains 
near the equator. On the other hand, on bonds (xy) where cr^ and ay are far from 
the equator (i.e. bonds deep inside one of the regions Ri), we typically have Jxy ~ f3. 
Thus, the couplings {Jxy} of the induced Ising model have typically the following 
structure: there are regions Ri of linear size ~ ^ within which the couplings are very 
strong (of order (3), surrounded by transition layers (of width probably ~ 1) where 
the couphngs are of moderate strength (of order 1). 

Now, from this information alone we cannot determine the phase structure of the 
induced Ising model. For example, if the couplings {Jxy} in the transition layers 
— which are "of order 1" — were typically greater than the critical coupling Jc,ising 
for a d-dimensional translation-invariant Ising ferromagnet, then the induced Ising 
model would surely exhibit long-range order. (The strongly ferromagnetic couplings 
Jxy ~ /5 within the regions Ri would only strengthen this long-range order.) On the 
other hand, if the transition-layer couplings were sufficiently smaller than Jc,ising, then 
the induced Ising model could exhibit exponentially-decaying correlations in spite of 
the large couplings Jxy ~ /3 within the regions Ri. The distinction between these 
two scenarios seems to depend on the precise values of the transition-layer couplings 
{Jxy}, as well as on the widths of the transition layers — quantities which our order- 
of-magnitude argument is too crude to predict. 

On the other hand, we know that the original iV-vector model is in a phase with 
exponentially decaying correlations; in particular, we expect that 

(sgn(o-, • r) sgn(o-^ • r)) - e-l"-^l/« (2.3) 

as |x — y| — > oo. [This is not the usual correlation function {cTx ■ CTy), but it should 
exhibit the same correlation length.] On the other hand, the unconditional expecta- 
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tion ( p.3|) is equal to the average over {cr , |crll|} (with Boltzmann-Gibbs weight) of 
the conditional expectations 

(sgn(<T^ ■ r) sgn{ay ■ r) | {(t^, |crl'|}) = {£x£y){j^y} ■ (2.4) 



Now, the conditional expectations ( p.4|) are always nonnegative, since the induced 
Ising model is purely ferromagnetic. So there is no possibility of "cancellations" in 
forming (|2.3|) from (O). We conclude that the induced-Ising-model correlations (|2.4| ) 



should also exhibit exponential (or faster-than-exponential) decay, on a characteristic 
scale ^ (or smaller), for "nearly all" configurations {cr-*-, |crll|}. 

This exponential decay is clearly not due to the large couplings J^y ~ P within the 
regions it must be due to the moderate couplings J^y ~ 1 in the transition layers. 
In other words, the transition layers do succeed in (almost-) decoupling neighboring 
(or at least almost-neighboring) regions This reasoning justifies, in the case of 
an asymptotically free model, our proposed mechanism for the good dynamic critical 
behavior {z ^ 0) of the idealized Wolff algorithm. 

Next consider the case of a critical point at finite temperature {(3c < oo), as in 
the two-dimensional XY model or the A^- vector models (all A^) in dimension d > 3. 
Here the spin configurations for j3 ^ jSc are rough at short distance scales: for a 
nearest-neighbor pair of sites x, y, we have {(Tx ■ (Ty) < C < 1 uniformly in (3 (~ Pc) 
and L. Thus, the distinction between "regions Ri" and "transition layers" is no 
longer so clear. The couplings {Jxy} will be highly irregular, but we may expect 
Jxy ~ /5c ~ 1 more or less everywhere. On the other hand, for (3 < j3c it remains true 
that the correlations ( p.3|) decay exponentially; so we may again conclude that the 
induced-Ising-model correlations ( |2.4| ) decay exponentially on scale ^ for "nearly all" 
{cr^, |crll|}. In this case, the mechanism of the decay is not the decoupling of regions 
Ri, but simply the fact that the "average" J^y is less than Jc,ising- Irrespective of the 
mechanism, however, it remains true that an independent resampling of the induced 
Ising model will produce a significant collective change on scale ^ (and all smaller 
scales). We may thus continue to expect that z ~ for the idealized Wolff algorithm. 

Remarks. 1. For the two-dimensional XY model near the Kosterlitz-Thouless 
transition, spin waves are not the only important large-scale collective modes: there 
are also large-scale collective modes associated with vortices. Therefore, to explain 
heuristically why 2; -C 2, it is necessary to show also that the (idealized) Wolff updates 
are capable of destroying a widely-separated vortex-antivortex pair. This has been 
done in [Q. 

2. The foregoing reasoning does not apply to a magnetized phase (except in the 
scaling region near the critical point, where the magnetization is very small): the 
trouble is that the spins will be aligned in some particular direction^, and nearly all 
the spins will lie in a single cluster R^ unless the direction of alignment happens to 
be almost perpendicular to r. Therefore, in order for the Wolff algorithm to work 



^In finite volume with free or periodic boundary conditions, this direction of ahgnment is random, 
and will vary from configuration to configuration. Nevertheless, in each individual configuration, 
the spins will be rather strongly aligned in some direction. 
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well in a magnetized phase, it may be necessary to choose the direction vector r with 
a probability distribution that depends on the current spin configuration {cr}. For 
example, one possible algorithm would be to choose at random a site x, and then 
choose r to be a random vector orthogonal to cr^. Since cr^ will be unchanged under 
the subsequent move, this algorithm is easily seen to satisfy detailed balance.^ (If 
N = 2, this choice of r given cr^ is deterministic, and the algorithm is nonergodic: 
for example, if the spins cr^ are all aligned along one axis, they will forever remain 
so. The ergodicity can be restored by admixing, with a nonzero probability, moves 
of the usual type with a randomly chosen r.) The hope here is that r would most of 
the time be chosen approximately perpendicular to the current total magnetization 

= J2x '^xi so that the Goldstone spin waves would be sliced as in Figure |l]. (One 
might inquire about the algorithm in which r is chosen to be perpendicular to M. 
itself. Unfortunately, this algorithm fails to satisfy detailed balance, and we strongly 
suspect that it fails to leave invariant the correct Gibbs measure.) 

3. In a preliminary version of this work p|, we emphasized an additional property 
of T, namely that it is involutive (i.e. its square is the identity map). This property 
ensures that the induced Ising model has zero magnetic field (which is convenient 
although perhaps not essential). In fact, it turns out that an isometry possessing the 
codimension-1 property is automatically involutive: see Theorem |A.9| in Appendix 



Next we would like to argue that the practical Wolff algorithm based on standard 
SW (or single-cluster SW) updates of the induced Ising model (with Nhu = 1) should 
also have z -C 2. We can give two alternative lines of reasoning: 

1) We have already argued that the idealized Wolff algorithm has z ^ 0. On the 
other hand, the induced Ising model is ferromagnetic, and the SW (or ICSW) algo- 
rithm is known to perform well for Ising ferromagnets: the exponential autocorrelation 
time behaves as Texp,sw ~ min(^, L)^'=^P'^'^, where the best currently available esti- 
mates of Zexp^sw (standard SW algorithm) for translation-invariant nearest-neighbor 
ferromagnets are 



- 0.3 for = 2 [9, |n|, |23|, |g ^ 



Zexp,sw ~ { 0.35 - 0.75 for = 3 R |n|, |2|, E§ (2.5) 



ioT d = A 113, HI 



Of course, our induced Ising model is a non-translation-invariant ferromagnet, but the 
performance is likely to be qualitatively similar. Now, N^n hits of the SW algorithm 
are at least 1 — e~^''''/'^'=^P'Sw times as effective as independent sampling in generating 
a "new" Ising configuration. Thus, we expect 

rWolff,practical ^ Twolff,tdeaUzed/ ~ e-^'^''^^^^^''^'^) (2.6a) 
~ TWolff,idealized'Texp,SW / Nhit ^ NhU ^ Texp,SW (2.6b) 

hence Zwolff, practical < Zyyolff,idealized + Zexp,SW ^ 2. 



®The same logic also applies if r is chosen parallel to ctx, as ctx will be unchanged up to a sign 
(and a change of sign of r is irrelevant). However, we do not know of any useful application for this 
algorithm. 
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2) Let us consider the bond configuration that will be generated in the first SW 
(or ICSW) update. If the nearest-neighbor spins (Tx and CTy belong to opposite 
hemispheres — that is, if (cTj, • r){(Ty ■ r) < — then the bond (xy) is guaranteed 
to be unoccupied. It follows that if cr^ and cr^/ are any pair of spins belonging to 
opposite hemispheres, then the sites z and z' are guaranteed to belong to different 
SW clusters. On the other hand, within each connected region Ri consisting of spins 
belonging to the same hemisphere, we expect that the bond occupation probabilities 
Pxy = 1 — e~'^^^y will mostly lie beyond the percolation threshold, so that most of the 
region Ri will belong to the same SW cluster. [This is clear in the asymptotically free 
case, where J^y ~ and Pxy ~ 1- In the finite-/3c case, however, the Jxy are of order 
1, and there is a nonzero probability (uniformly m. (3 jS^ and L) for a small cluster 
of spins to become detached from its surroundings. Thus, each region Ri would have 
the structure of Swiss cheese: there would be one or a few large SW clusters in which 
there live a nonzero density of small SW clusters. Nevertheless, it is reasonable to 
expect that the large clusters will still be of linear size ~ ^, i.e. that the small clusters 
will not be so numerous as to disconnect the large clusters into pieces of size -C ^.] 
Thus, each region Ri will be reflected more or less coherently — but distinct regions 
Ri will be reflected independently — in the first SW update. This corresponds to a 
long- wavelength collective mode (Figure 0). 

2.2 Wolff algorithm for the RP^'^ model 

Let us now consider a second example of a Wolff-type embedding algorithm, namely 
the case of the nonlinear a-model taking values in the real projective space RP^~^, 
where N > 3. This example differs from the A^- vector model in two principal ways: 

1) The equator in 5*^"^ not only has codimension 1, but has the stronger prop- 
erty of dividing the sphere into two disconnected regions (the northern and southern 
hemispheres). That is, 

b') The complement of Fix(T) is disconnected. 

This property implies the codimension-1 property, but is strictly stronger (see The- 



orems 



A.l| and [A.8| in Appendix It is not clear, at first glance, whether our 



heuristic argument in the preceding section was based on (b) or on (b'). Analysis of 
the RP^~^ model, which satisfies (b) but not (b'), will clarify this issue. 

2) In the Wolff embedding algorithm for the A^-vector model, the induced Ising 
model is unfrustrated . [When written in the form ( |1.1| ) / ( pT3D , it is manifestly ferro- 



magnetic; when written in the alternate form ( p.lOD -( p.l2D , it is equivalent via a Z2 



gauge transformation to a ferromagnetic model.] In the embedding algorithm for the 
jlpN-i model, by contrast, the induced Ising model is generically frustrated. It is 
important to know what difference, if any, this makes. 

The real projective space RP^~^ is, by definition, the sphere S^~^ with antipo- 
dal points identified, i.e. RP^~^ = S^'^jZ^- One could now proceed to pick a 
parametrization of RP^~^ (e.g. by embedding it diffeomorphically as a submanifold 
of some M'" H^); but it is simpler and more convenient to consider instead spins 



taking values in the sphere S , subject to the condition that the Hamiltonian and 
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all physical observables must be invariant under the Z2 local gauge transformation 
Vx^^x iVx = The simplest lattice Hamiltonian for this model is therefore 

mcT}) = -^Ei^x-cTy)'. (2.7) 

(xy) 

We use the same embedding as before, namely (|1.1| ), but the coefficients {Jxy} in the 
induced Ising Hamiltonian ( |1.3|) are now 

Jxy = f3{(T^-(T^)\a-x-r\\(Ty-r\. (2.8) 

(For > 3, this is generically a frustrated Ising model.) Since the reflection in the 
equator on S^~^ commutes with the map cr —cr, this reflection also induces a well- 
defined map T on the quotient space RP^^^. Its fixed-point set Fix(T) is the union 
of two disjoint components: one component consists of a single point, namely the 
pole (= the image of the north or south pole in 5*^"^ under the canonical projection 
tt: 5*^^^ RP^~^)] the other component consists of the "equator" in RP^~^ (= the 
image of the equator in S*^"^ under the canonical projection tt), which is isometric 
to RP^~^. Note that the equator in RP^~^ has codimension 1, but (unlike what 
happens in 5*^"^) it does not disconnect the space RP^~^\ this is because in RP^^^ 
there is no distinction between the northern and southern hemispheres. Therefore, 
the map T on RP^~^ satisfies property (b) but not (b'). 

We now wish to analyze the behavior of the idealized Wolff algorithm for the 
^pAf-i jY^odel, by heuristic arguments analogous to those used for the A^-vector model. 
Consider, therefore, a slowly varying spin configuration {cr}. For such a configuration, 
we can imagine the lattice if" to be replaced by a continuous space M*^, and we can 
imagine the configuration {cr} to be a smooth map from into RP^'^. Then the set 
of points in x-space where cr -r = is again (generically) a codimension-1 submanifold 
(= hypersurface) of (see Theorems A.4| and |A.5|) . Since R"^ is simply connected. 



this hypersurface divides R into at least two disconnected regions Ri (Theorem |A.2| ) 



[Alternative argument: Since R'^ is simply connected, the map cr: R'^ RP^~ 
lifts to a continuous map or: R'' S^~^ satisfying cr = vr o or. Now make the 
same argument as for the iV-vector model, but using or in place of cr.] Therefore, 
by the same arguments as in the A^-vector case, the induced Ising Hamiltonian ( |2.8| ) 
almost decouples along the surfaces where cr (or equivalently or) lies on or near the 
equator, and under an idealized Wolff update the regions Ri will be updated almost 
independently. We thus predict that the idealized Wolff algorithm for the RP^~^ 
model (in a non- magnetized phase) will have dynamic critical exponent z <^2, and 
quite possibly z ~ 0. 

Remark. If x-space is not simply connected (e.g. X is the torus T'^ if periodic 
boundary conditions are used), then the foregoing argument fails: a single hypersur- 
face need not disconnect X . However, a few hypersurfaces suffice to disconnect X 
(see Theorem |A.3| ); and if 1 -C ^ < L/4, then it is natural to expect that, with rea- 
sonable probability, the set where cr ■ r = will have several connected components. 
So we expect the algorithm to have the same dynamic critical exponent irrespective 
of the boundary conditions used. At worst, the autocorrelation time might be larger 
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by a constant factor for periodic as compared with free boundary conditions, for the 
same lattice size. 

As in the A^-vector case, it is necessary to inquire more closely into the assumption 
that {cr} is slowly varying. Consider, as before, the two cases: 

(a) The critical point is at zero temperature. This case corresponds to asymp- 
totically free theories; low-temperature perturbation theory suggests that the two- 
dimensional RP'^'^ models for > 3 are asymptotically free, but the Monte Carlo 



evidence is ambiguous (see p9| for references). As /3 — >• /3c = oo, the typical spin 
configurations become smooth modulo a sign: for a nearest-neighbor pair of sites 
X, y, we have ((cr^. ■ (Ty)"^) ^ 1 — c//3 for a (calculable) constant c. We now claim that 
the couplings {Jxy} have the following properties: 

(i) In regions where cr is not near the equator or the pole (i.e. deep inside one of 
the regions Ri), the couplings {Jxy} are of order (3 and globally unfrustrated . 

(ii) In regions where cr is near the equator (i.e. the transition layers between the 
regions -Rj), the couplings {Jxy} are of order 1 and locally unfrustrated, though 
they may be globally frustrated. 

(iii) In regions where cr is near the pole (these are isolated patches), the couplings 
{Jxy} are of order 1 and may be frustrated. 

The statements concerning the typical magnitudes of Jxy are demonstrated exactly 
as in the A^- vector case, using (|2.8|) . The statements concerning local frustration 
are proven as follows: Let Xi, . . . , x„, x„+i = a;i be a (small) closed path of nearest- 
neighbor sites such that the spins cr^^ , . . . , cTxn are all close to each other and not 
near the pole, i.e. there exists C > | such that |cr^^ ■ cr^^| > C and Icr^,. ■ r| < \/C for 

all ^5 J 1, . . . , 7Z. Then Jcycle — JxiX2Jx2X3 ' ' ' Jxn—lXnJxnXi ^ 

To study the global frustration, define Jxy = ■ (^y){(^x ■ r)(cry • r) [these 

couplings are equivalent to {Jxy} via a Z2 gauge transformation]. Now draw a bond 
between any nearest-neighbor pair of sites (xy) for which Jxy > 0. This will occur, in 
particular, if cTx and cTy are close to each other and not near the pole or the equator.]^ 
The connected components formed by the bonds thus drawn correspond roughly to 
the regions Rf, it is reasonable to expect that they are of linear size ~ ^, although they 

^ Proof: Note first that changing ctj,. to — cr^. leaves Jcycie unchanged; therefore we may choose 
the signs of c^^, . . . , cr^.^ so that cr^i • (Tx^ > C > for i = 2, . . . , n. Since C > i, this means 
that the angle between a^^ and (Tx^ is less than 60°; so the angle between cr^i and is less 
than 120°, i.e. cr^. ■ cr^. > — i. But jcTj,. • cr^^l > C > ^ by hypothesis, so cr^. • cr^^ > C. Then 

^x, ■ <^x, = <^x, ■ J - (cTx, ■ r)(crxj • r) > C - (n/C)^ = 0. So each coupling Jx,x,+i is > 0. 

^°More precisely, suppose that there exists e > such that \(Tx ■ cry\ > 1 — e and < jcTj. • 



r|, \(Ty ■ r| < — e. Then we claim that Jxy > 0. Proof: Since the substitutions cTx —cTx and 
(Ty ^ —CTy leave J^y unchanged, we can assume without loss of generality that cr > 0. 

Then cTx ■ a-y = cr^ ■ cTy + {cTx ■ r)(cy • r) > — 1 + (\/e)^ = —(1 — e), so we must have (Tx - o-y > 1 — e. 
Then er^ ■ (t:I^ ^ (Tx ■ (Ty - (cr^ • r){cry • r) > (1 - e) - (Vl - e)^ = 0. So Jxy > 0. 
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could in principle be larger.]^ Within each such connected component, the induced 
Ising model is globally unfrustrated: indeed, for any closed path of nearest-neighbor 
bonds (xi,X2), (x2,X3), . . . , (x„_i,x„), (x„,x„+i = Xi) such that Jx,x,+-, > for all i, 

we trivially have J^yde = U7=i Jx^xi+i = IlILi Jx,x,+i > 0. 

What is the phase structure of this induced Ising model? Here we cannot argue 
as we did in the A^- vector case [cf. (P7^)/(P^)], because the global frustration of the 



{Jxy} in the transition layers could cause "cancellations". Instead, we can distinguish 
at least three a priori possibilities: 

(i) The induced Ising model exhibits exponentially decaying correlations on a char- 
acteristic scale ^ (or smaller). 

(ii) The induced Ising model lies in a spin-glass phase of Parisi-Mezard-Sourlas- 



Toulouse-Virasoro type [pO| , |32| , |33| , i.e. there are many "dominant" regions 
of configuration space which differ on long as well as short length scales ( "rough 
free-energy landscape" ) . 

(iii) The induced Ising model lies in a spin-glass phase of Fisher-Huse type p4 



I.e. 

there are only two "dominant" regions of configuration space, related to each 
other by a global spin flip. 

In cases (i) and (ii), an independent resampling of the induced Ising model will pro- 
duce a significant collective change on scale ^ (and all smaller scales), so we expect 
2; for the idealized Wolff algorithm. In case (iii), by contrast, the idealized Wolff 
updates do essentially nothing (merely a global spin flip), and the performance should 
be poor. 

In dimension d = 2, it is believed that there is no spin-glass phase for the ordinary 



random-bond Ising model ||35[. That does not imply anything, of course, for our 
induced Ising model, in which the structure of the couplings is quite different, but it 
does suggest that scenarios (ii) and (iii) are unlikely, and that scenario (i) is probably 
the correct one. We therefore expect that z ~ for the idealized Wolff algorithm. 
This prediction has been confirmed in a preliminary study ||29| of the two-dimensional 
RP^ model. 

(b) The critical point is at finite temperature. This case is expected to hold in 
the RP^^^ models (all A^) in dimension d > 3, whenever the phase transition is 
continuous (as opposed to first-order). Unfortunately, in this case we have little to 
say: the typical spin configurations for j3 ^ jSc are rough at short distance scales, and 
the induced Ising model will be highly frustrated. Any one of the scenarios (i)-(iii) 
just discussed — and possibly others as well — could occur. So we are unable to 

There is no topological obstacle to these connected components being of size much larger than 
^ (e.g. spanning the whole lattice). Indeed, any two spin values cr^,, cTx' not lying on the equator or 
the pole can potentially be connected by a chain of spins ctx = (Tx„,crxn ■ ■ ■ tCTx^. = ctx' such that 
JxiXi+i = {'^xi''^xi-i-i){'^xi-^){cxi-i-i-^) > for alii. Proof: Assume without loss of generality that 0-3; 
and cTx' both lie in the northern hemisphere, with longitude((T2;) = 0°. Then, if |longitude((T3;/)| < 
90°, we can take fc = 1; if 90° < |longitude(<Ta;')l < 180°, we can take k = 2; and if |longitude((T2;')| = 
180°, we can take k = 3. 
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predict with confidence whether the ideahzed Wolff algorithm will exhibit 2 ^ or 
not. 

The behavior of the practical Wolff algorithm based on standard SW (or single- 
cluster SW) updates is even less clear. Although the induced Ising Hamiltonian ( |2.8| ) 
is frustrated, this does not necessarily mean that the SW algorithm will perform 
poorly for it. To see this, note first that if Jxy = P{o-x ' ^y)i^x ■ ^){<^y ■ r) < 0, then 
the bond {xy) is guaranteed to be unoccupied. If such bonds are numerous enough 
to divide the lattice into at least two large clusters Ri of size ~ ^, then the SW 
algorithm will flip these clusters independently — and thus make a long- wavelength 
collective-mode change in {cr} — irrespective of the frustration within each Ri. On 
the other hand, there is no topological guarantee that the lattice will be divided in 
this way (unlike what happens in the iV- vector case); the clusters Ri might have size 
much larger than ^, in which case the flips would not be of much use.|3 This whole 
question deserves further study P5| . 



2.3 Generalized WolfF-type embedding algorithms 

It is now clear how to generalize this reasoning to a nonlinear a-model taking values 
in an arbitrary compact Riemannian manifold M. Let us assume that the lattice 
Hamiltonian is of the form 

H{{rT}) = /3^E{cT,,cTy) (2.9) 

where E{a, cr') = E{a', cr) ~ const -|-c?(cr, cr')^ as cr' ^ cr (here d is geodesic distance 
in M).0 Now let T: M ^ M be any map satisfying: 

a) T is energy-preserving, i.e. E{Tcr,Tcr') = E{cr,cr') for all cr, cr' G M. [In 
particular, by taking cr' cr it follows that T preserves the metric tensor, and 
is therefore an isometry.] 

We can then define an embedding of Ising spins {e} by the rule 

^ I 1 V —.new —old 



1 v —new 'T—.ol 

— i =^ cr^ = I cr^. 



old (2.10) 



^^In the iV-vector model, spins cTx and a^' belonging to different hemispheres [i.e. for which 
(<T • r)(<Ta;/ • r) < 0] are guaranteed to belong to distinct SW clusters. In the RP^~^ model, by 
contrast, any two spins <t^, cTx' not lying on the equator or the pole can potentially be connected by 
a chain of spins cr^ = (T.xg,crxi,. ■ ■ , cr^^ = cr^/ such that JxiXi+i = (cr^. • cr^^^ J(crj.^ •r)(cr^.^j -r) > 
for all i: see the preceding footnote. Thus, er^. and cTx' could belong to the same SW cluster. 

^•^More generally, we could sum over oriented bonds {x, y) and allow E to be non-symmetric [i.e. 
E{cr,(T') E{(T',cr)]: this situation arises, for example, in an A^- vector model in a fixed external 
gauge field (in particular, in a frustrated A^- vector mod el). However, only the symmetric part of E 
contributes to the induced Ising Hamiltonian [see ([2!^ ) below]. 
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so that the induced Ising Hamiltonian is 



(xy) {xy) 

where 

Jxy = ^[EiTcT,„CTy) + E{(T,,T(Ty) -2Ei(T,,(Ty)] (2.12a) 
hxy = j[E{T(T^,(Ty)- E{(T,,T(Ty)] (2.12b) 

(For the original Wolff algorithm, this definition of {e} differs from ( |1 . 1| ) by a Z2 gauge 
transformation.) In particular, if T is involutive, then E[Tcrx, cry) = E{(Tx,T(Ty), so 
that the induced Ising Hamiltonian has zero magnetic field: 

Jxy = ^[E{ax,Tcry) - E{(Tx,(Ty)] (2.13a) 
Ky = (2.13b) 

We remark that in general the couplings ( |2.13| a) are frustrated; indeed, in Appendix ^ 
we prove that non-frustration occurs essentially only in the codimension-1 algorithm 
for the A^-vector model, i.e. the original Wolff algorithm. (See Theorem |A.1U| and 
Corollaries |A.11| and [A.12| for details.) 

A Wolff-type embedding algorithm is now specified by: 

(i) a collection T of energy-preserving maps of M; and 

(ii) a probability distribution p on T. 

One step of the algorithm consists of the following operations: 

(i) Choose randomly (with probability distribution p) a map T E T. 

(ii) Initialize {e} = +1. 

(iii) Update the induced Ising model ( p.ll| )/( p.l2| ), using any chosen Monte Carlo 
algorithm (e.g. standard SW). 

(iv) Update the {cr} according to ( [2.101 ). 

If the group generated by T acts transitively on M, then (under mild conditions on p) 
this algorithm will be ergodic. If not, then the foregoing moves must be supplemented 
by other types of updates (e.g. a local heat-bath or Metropolis update of the {cr}) so 
as to make the combined algorithm ergodic. 

The idealized embedding algorithm is, by definition, the one in which step (iii) 
consists of obtaining an independent sample from the induced Ising model. 

Typically T is obtained from a single map T by conjugating it with isometrics 
of M, and p is taken to be an invariant measure; this is the case, for example, in 
the original Wolff algorithm, with its random choice of the unit vector r. We shall 
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henceforth assume T to be of this form, and shall refer to "the" map T as a shorthand 
for "any one of the maps T G T" . 

In summary, the embedding algorithm can be defined for any collection T of 
maps which are energy-preserving. But we claim that the algorithm will be successful 
in eliminating or at least radically reducing the critical slowing-down — i.e. have 
a dynamic critical exponent z significantly smaller than the z ^ 2 typical of local 
algorithms — only if a second property is satisfied: 

b) The fixed-point set Fix(T) = {<t G M: Tcr = cr} has codimension 1. 

Let us now set out, in detail, our reasoning supporting this claim: 

1) In order for an algorithm to do a good job of beating critical slowing-down (i.e. 
have dynamic critical exponent 2 <^ 2), it is necessary that the algorithm be 
capable of making quickly (i.e. in one or a very few time steps) a significant 
change in all of the large-scale collective modes that are relevant for the (static) 
critical behavior of the given model. 

2) Among the important large-scale collective modes in the nonlinear a-models are 
the long-wavelength spin waves. (There may also be other important modes, 
such as vortices in the two-dimensional XY model; but there are in any case at 
least spin waves.) 

3) It follows that for the algorithm to have z <^2, it is necessary that it do a good 
job of handling long- wavelength spin waves. (This may not be sufficient, if other 
types of collective modes are also important. For example, the multi-grid Monte 
Carlo algorithm |T^, does an excellent job of handling long- wavelength spin 



waves, but does less well in handling vortices; as a result, it eliminates critical 
slowing-down in the low-temperature (spin-wave) phase of the two-dimensional 
XY model, but not on the high-temperature side of criticality [jl8|. The same 
appears to be true of Fourier acceleration p2| .) 



4) An idealized Wolff-type embedding algorithm does a good job of handling spin 
waves if and only if it causes x-space to be divided into two or more large 
disconnected regions Ri which are almost decoupled from each other in the 
induced Ising Hamiltonian. The same holds true for a practical Wolff-type 
embedding algorithm, with "if and only if" replaced by "only if" . 

5) The induced Ising Hamiltonian decouples along the manifold {x: cr(x) feFix(T)} 
{x: cr{x) ~ Tcr(x)}, and only there^ 

6) For a manifold to divide x-space into disconnected regions, it is necessary that 
the manifold be of codimension or 1 (see Theorem "Kl in Appendix 0). 



^"^Proof: Assume that E{(t, cr') has a unique minimum for fixed cr when cr' ^ a (and hence 
also for fixed cr' when cr — cr'). Then consider a bond {xy): we have cr^ ~ cry, since the original 
configuration is assumed smooth. Then Jxy « if and only if E{Tcrx, cry) and E{crx,Tcry) are 
both w E{crx,cry) « mini?. It follows that TcTx « cTx (and Tcr^ « ay) by the uniqueness of the 
minimum of E. 
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7) For a "generic" smooth spin configuration cr: — >■ M, the codimension of 
the set {x: (t{x) G Fix(T)} in x-space equals the codimension of Fix(T) in the 
target manifold M (see Theorems [A.4| and |A.5| ). 



Fix(T) cannot have codimension except in the trivial case Fix(T) = M (see 
Theorem [A. 81). 



9) Conclusion: For a Wolff-type embedding algorithm to have ^ -C 2, it is nec- 
essary (though perhaps not sufficient) that Fix(T) have codimension 1 in the 
target manifold M. 

In our opinion, the only truly questionable step in this reasoning — at least as 
regards asymptotically free models — is Step 4. We have explained previously why 
an idealized Wolff-type embedding algorithm does a good job of handling spin waves 
if it causes x-space to be divided into large disconnected regions which are almost 
decoupled from each other. It is a substantial — and quite possibly unjustified — leap 
to assert that this is the only mechanism by which an idealized Wolff-type embedding 
algorithm can do a good job of handling spin waves. For example, suppose that the 
induced Ising Hamiltonian were to lie in a highly frustrated phase characterized by 
several "dominant" regions of configuration space which differ on long as well as 
short length scales. Then the corresponding idealized Wolff-type algorithm would, by 
definition, make frequent transitions between these regions; in particular, it would 
make frequent global changes in the {e} configuration, and hence presumably also in 
the {cr} configuration. These global changes might be sufficient for the algorithm to 
do a good job of handling spin waves.[^ (Of course, a practical Wolff-type algorithm 
might have great difficulty surmounting the barriers between these dominant regions 
of configuration space. But that is not our concern at present.) 

Since our reasoning is based on considering "smooth" spin configurations, it is also 
questionable when applied to non-asymptotically-free models, as already discussed in 



Sections and |2.2| . Possibly our reasoning does become valid when reinterpreted as 
referring to a suitably coarse-grained spin field; but possibly not. Some insight into 
this problem could perhaps be obtained by comparing the (ideahzed) codimension-1 
and codimension-2 algorithms [cf. ( |3.1| ) for r = 1,2] for the t/iree- dimensional A^- 
vector model. However, it will be difficult to disentangle the effects of codimension 
from those of frustration, as the two phenomena go together (codimension 1 is un- 
frustrated, while codimension > 1 is frustrated). In this regard the three-dimensional 
jlpN-i ]2iodel (if its phase transition is indeed continuous) could serve as a valu- 
able comparison, since in this case both codimension 1 and codimension > 1 lead to 
frustrated induced Ising models. 

All in all, we consider our Conclusion to be a plausible conjecture, which needs to 
be tested carefully. In Section ^ we test it in one asymptotically free model. 
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3 Classification of (Involutive) Isometries 



The reasoning of the preceding section leads us to the following problem in differential 
geometry: Given a Riemannian manifold M, classify all the isometries of M according 
to the codimension(s) of their fixed-point manifolds. In particular, if we are interested 
in fixed-point manifolds of codimension 1, we can restrict attention to involutive 
isometries (see Theorem [A.9|) . 

If T is an isometry (resp. involutive isometry) of M, and / is any isometry of M, 
then f oTo f^^ is also an isometry (resp. involutive isometry) of M; we say that it is 
conjugate to T via the isometry /. Obviously, conjugacy is an equivalence relation; 
for any given manifold M, our goal will be to classify the (involutive) isometries of 
M, modulo conjugacy. Note that if p is a fixed point of T, then f{p) is a fixed 
point of / o T o /~^. In particular, if Fix(T) is nonempty and the isometry group 
Isom(M) acts transitively on M , then the conjugating map / can be chosen so that 
Fix(/oTo/"^) passes through any desired point g e M (though not necessarily in any 
desired direction). So we can restrict attention, if desired, to (involutive) isometries 
T that leave fixed some chosen point p. 

3.1 Some Examples 

Example 1. M = 5^^^. All involutive isometries of S*^"^ are conjugate (via an 
orthogonal transformation) to 

T(a\...,0 = (-a^...,-a^a^+^...,0 (3.1) 

for some r (0 < r < A^).Q Henceforth we shall write this transformation more 
compactly as cr — J^cr, where 

Ir = diag( -l,..^.,-l , +1,...,+1 ) . (3.2) 
r times N — r times 

For < r < — 1 the fixed-point manifold has codimension r; for r = N the fixed- 
point manifold is empty. (More precisely, for < r < — 2 the fixed-point manifold 
is isometric to S^~^~^] for r = A^ — 1 it consists of two points.) The standard Wolff 
reflection ( |2.2| ) corresponds to the case r = 1. 

Example 2. M = RP^-\ Every isometry T of RP^-^ = S^-^/{±I} "lifts" 
(nonuniquely) to an isometry T of S*^"^ [hence a matrix T G 0{N)]] and conversely. 

Proof: Every isometry of S^"^ is induced by an orthogonal matrix T G 0{N). An involutive 
isometry has = /. It follows that T is symmetric as well as orthogonal, and hence has eigenvalues 
±1. Any such matrix can be diagonalized by a rotation; and by a further permutation of coordinates 
(which is also an orthogonal transformation) we can make all the —1 eigenvalues come first. Thus, 
there exists a matrix R e 0{N) such that RTR~^ = Ir = diag(— 1, . . . , —1, +1, . . . , +1 ) for some r 

r times N — r times 

(0 < r < N). 
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every isometry of S^~^ commutes with ±J and hence induces a (unique) isometry on 
RP^~^. Moreover, T is involutive if and only if = ±J. We must therefore analyze 
two cases: 

(a) If = +J, then T is conjugate [via an orthogonal transformation R G 0(A^)] 
to a transformation of the form ( |3.1| ) for some r. Moreover, since R commutes with 
±/, this conjugacy acts also on RP^"^, i.e. T is conjugate to the map induced on 

(b) If = -/, then must be even [since det(T2) = (detT)^ = +1]; and T is 
easily seen to be conjugate [via some R G 0(A^)] to the map cr Jcr, where 



and / is an (A^/2) x (A^/2) identity matrix.[] 

Thus, T is conjugate to the map induced on RP^^^ by one of the following: 

(aj.) cr — i> IrCr. [Note that (a^) and (aM-r) are the same transformation.] The 
fixed-point set is the union of two disjoint connected manifolds, one isometric 
to RP^~^~^ and the other isometric to RP^~^; they have codimensions r and 
N — r, respectively. 

(b) [Only for N even] cr — > Jcr. The fixed-point manifold is empty. 

Example 3. M = CP^^^ . The complex projective space CP^^^ is, by definition, 
the set of unit vectors in modulo the equivalence relation 

z ^ z' <^=^ z = e^^z' for some real number 9 . (3-4) 

There is a unique (up to multiples) Riemannian metric on CP^~^ that is invariant 
under the natural action of U{N), namely the Fubini-Study metric [^B], vol. II, pp. 



159-160, 169]. The geodesic distance associated with this Riemannian metric isf^ 



d{z,w) = 2 cos ^ 



N 



1=1 



(3.5) 



(where, by abuse of language, we do not distinguish between an equivalence class 
z G CP^~^ and a representative unit vector z G C^). The isometries of CP^~^ 
{N > 2) are fully classified by Wigner's theorem: every such isometry arises from 
a map which is either unitary or antiunitary.|3 Thus, any isometry of 

CP^-^ is of one of the forms 



Proof: Since T^T = / and = —I, we conclude that T is antisymmetric as well as orthogonal, 
and hence has eigenvalues ±i. Any such matrix can be brought into 2x2 block-diagonal form by a 
rotation; and this is equivalent, by a permutation of coordinates, to J. Thus, there exists R G 0{N) 
such that RTR-^ = J. 

^^This is easily demonstrated by integrating the Fubini-Study metric along the geodesic curves in 
CP^^^ vol. II, p. 277], and appealing to unitary invariance. 

^^See e.g. |3 or [||, pp. 305-306]. 
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(a) z^Uz with U G U{N) 

(b) z^Uz with U G U{N) 

A httle analysis then shows that every involutive isometry of CP^^^ is conjugate to 
one of the following]^: 

(a^) z — > IrZ. [Note that (a,.) and {&N-r) are the same transformation.] The fixed- 
point set is the union of two disjoint connected manifolds, one isometric to 
(jpN-r-i ^YiQ other isometric to CP^'~^; they have codimensions 2r and 
2(iV — r), respectively. 

(bl) z ^ z. The fixed-point manifold is connected and isometric to RP^^^] it has 
CO dimension — 1. 

(b2) [Only for even] z Jz. The fixed-point manifold is empty. 

We conclude that in CP^~^ (N > ?>), there do not exist involutive isometries of 
codimension 1. (For N = 2, CP^ is isometric to the sphere 5*^, so an involutive 
isometry of codimension 1 of course does exist, namely z ^ z.) An even stronger 
nonexistence result is mentioned in Further Remarks 3 at the end of Appendix [A. 3. 



Example 4. M = SU{2). Since SU{2) is isometric to via the map 

(3.6) 



(7° + ia^ cr^ + ia^ 
— cr^ + ia^ (T° — ia'^ 



where t^,t^,t'^ are the usual Pauli matrices, this is a special case of Example 1. 
But it is useful to translate those results into the SU{2) matrix language, as a hint 

^^Proof: Let T be an involutive isometry of CP^~^. In case (a), we must have — e*"J, and 
by redefining the phases we can require — I. It follows that the eigenvalues of U are ±1, so 
there exists V G SU{N) such that U = VlrV^ for some r [0 < r < N). Thus T is conjugate to 
the map z IrZ. In case (b), we must have UU — e'"/, hence U — e*"C/"^. Since U ^ 0, the only 
possibilities are e'" = ±1; and by taking determinants we see that e*" = — 1 can occur only for N 



even. If UU — I, then U = XX^ for some X £ U{N) [see the proof of case (cl) in Theorem B.4]; 



but then, taking f{z) = XU, we have {foTof-^){z) = z. Similarly, iiUU = then U ^ XJX^ 



for some X e U{N) [see the proof of case (c2) in Theorem B.4|; but then, taking f{z) — X''z, we 
have (/oro/-i)(z) = Jz. 

Let us now compute the fixed-point manifolds F: Case (a): F consists of points z satisfying 
z = e^'^IrZ for some a. For a ^ 0, tt there arc no z ^ which satisfy this equation. Thus F consists 
of z which satisfy either z = IrZ or z = —IrZ. These equations define disjoint connected manifolds 
isometric to CP^~'^~^ and CP^^^, respectively. Case (bl): F consists of points z satisfying z = e'"z 
for some a, or equivalently ze~*"/^ = ze"*"/^. Redefining phases, we can assume that a — 0. So F 
is a connected manifold isometric to RP'^^^. Case (b2): F consists of points z satisfying z — e^°'Jz 
for some a. It follows that z — e~'" Jz = J^z = — z, which is clearly impossible for z ^ Q. So the 
fixed-point manifold is empty. 

See also [^, Theorem 7] and [^, Theorem 4.2] (case AIII with p = N — 1, q = 1) for an alternate 
derivation of this list, based on a Lie- algebraic analysis. 
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toward generalizations to SU{N). Recalling the definitions Ii 
f 1 ^ 



-1 
1 



and 



J 



-1 



here is a (redundant) list of some involutive isometries of SU{2): 



Codimension 0: A ^ A {= -JAJ) Fixed points: All SU{2) 



Co dimension 1: A 

A 



-At 

hA^h 
A^A^ 
A hA^i 



Fixed points: 0"° = 

Fixed points: = 

Fixed points: = 

Fixed points: = 



Codimension 2: A 
A 



A 

hAh 
etc. 



Fixed points: cr^ = cr^ 
Fixed points: = cr^ 








Codimension 3: A ^ A^ (= —JA^J) Fixed points: = a"^ = = 

etc. 



Example 5. M = SU{N). In Appendix Q we carry out a complete classifi- 
cation (modulo conjugation by an isometry) of the involutive isometries of SU{N). 
[Surprisingly, we have been unable to find this classification anywhere in the math- 
ematical or physical literature.] In particular, the involutive isometries of SU{N) 
having nonempty fixed-point manifold are the following^: 

{a.r,r) A IrAIr- [Note that {a.r,r) and {aN^r,N~r) are the same transformation.] 

The fixed points are matrices of the form with B G U{r), C G U{N — r) 

and (det i?)(det C) = 1. We call this space S{U{r-) x U{N — r)); it is a subgroup of 
SU{N), and has codimension 2r(A^ — r). 

(bl) A — > At. The fixed-point manifold is 

U 

0<r < N 
r even 

where = {f//,f/t: U G SU{N)} is the coset space SU{N)/S{U{r) x U{N - r)); it 
has codimension -|- {N — r)^ — 1. 

(b2) [Only for N even] A — > e^'^^/^ . The fixed-point manifold is 

U 

0<r < N 
r odd 



^^See also Theorem 3.3] for an alternate derivation of this list, based on a Lie-algebraic 
analysis. 
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where Fr = {e^'/^UIrW: U G SU{N)} is isometric to the coset space SU{N)/S{U{r) x 
U{N — t)); it has codimension + (A^ — r)^ — 1. 

(cl) A —>■ A. The fixed points are real matrices A G SO{N). This space is a 
subgroup of SU{N), and has codimension |(A^^ + — 2). 

(c2a) [Only for N even] A —>■ —jAj. The fixed points are matrices A that 
are symplectic {A^JA = J) as well as unitary. We call this space[^ USp{N/2) = 
Sp{N/2, C) n U{N); it is a subgroup of SU{N), and has codimension ^{N"^ - N -2). 



(dl) A . The fixed points are matrices A = UU^ with U G SU{N). This is 

the coset space SU{N)/ SO{N); it has codimension |(A^^ — A^). 

(d2) [Only for even] A —A^. The fixed points are matrices A = UJU'^ or 
A = e^'^'^'^UJU'^ with U G SU{N). These two disjoint manifolds are each isometric 
to the coset space SU{N)/ USp{N/2); they have codimension j{N^ + N). 

Therefore, in SU{3) the involutive isometrics (other than the identity map) having 
nonempty fixed-point manifold are: 

Codimension 3: A ^ Fixed points: SU{3)/SO{3) 

Codimension 4: A^ hAh Fixed points: S{U{1) x U{2)) 

A^ A^ Fixed points: SU{3)/S{U{1) x U{2)) 

Codimension 5: A A Fixed points: 5*0(3) 
In SU (4) the non-identity involutive isometrics of smallest codimension are 
Codimension 5: A — JAJ Fixed points: USp{2) 

Codimension 6: A ^ hAh Fixed points: S{U{l)xU{3)) 
A^A^ Fixed points: SU{A)/SO{A) 

In SU{N), N > 5, the non-identity involutive isometry of smallest codimension is 
A IiAIi, with codimension 2A^ — 2. 

We conclude that in SU{N), N > 3, there do not exist involutive isometries of 
codimension 1 (or even codimension 2). 



3.2 General Theory 



Now let us try to find all Riemannian manifolds (within a certain class to be defined 
below) admitting an involutive isometry with fixed-point manifold of codimension 1. 
(Such an involutive isometry is sometimes called a mirror.) We need a few definitions 
and facts from differential geometry [^6], ^ ^ : 



^^See e.g. pp. 347-349]. In the differential-geometry literature (e.g. |42, p. 340]) this group is 
often called Sp{N/2). 
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A Riemannian manifold is called homogeneous if it possesses a transitive group 
of isometries [i.e. if for all p,q & M there exists an isometry / such that f{p) = q]. 
Every homogeneous Riemannian manifold is complete. 

A Riemannian manifold is called symmetric (or globally symmetric, or a symmetric 
space) if for each p & M there exists an involutive isometry of M having p as an 
isolated fixed point. Every Riemannian symmetric space is homogeneous, and the 
Riemannian symmetric spaces have been completely classified Theorems 8.11.4 
and 8.3.12]. Virtually all the target manifolds arising in physical applications are 
symmetric spaces (The only exception we know of arises in the a- model 

approach to string theory |^ in which general target spaces are needed.) 

Let M be a Riemannian manifold with metric tensor g and curvature tensor R. 
Then, for any point p & M and any two-dimensional vector subspace S of the tangent 
space TpM, the sectional curvature at p along S is defined to be 

K{p S) = 9p{Rp{v,w)w,v) 

gp{v,v)gp{w,w) - gp{v,wy' 

where v and w are any two linearly independent vectors in 5*. If K{p, S) = k for all 
p & M and all planes S C TpM, then M is called a space of constant curvature kQf^ 
Much is known about spaces of constant curvature: see [Q, and especially Corollary 
2.4.10, Theorem 2.5.1, Theorem 2.7.1 and Corollary 2.7.2. For our purposes, the 
following results suffice: 

Theorem 3.1 Let M be a connected homogeneous Riemannian manifold of dimen- 
sion n and constant curvature k. Then either 



(a) k < and M is isometric to the hyperbolic space H"^ of "radius" k ^1"^ 



in M'^+^ 



or 



(bra) [0 < m < n] k = and M is isometric to the product R™ x T""™" of a Euclidean 
space with a fiat Riemannian torus T"~™ = MP~''^/T (here T is a discrete sub- 
group of the translation group R"""*, generated by n — m linearly independent 
vectors) 

or else 

(c) k > and M is isometric to one of the following: 
(i) the sphere of radius k^^^"^ in R"^^ 



^•^In fact, if dim A/ > 3 it suffices that K{p, S) be independent of S for each p e M\ then a theorem 
of Schur [^8[ Theorem 2.2.7] ensures that K(j>^ S) is constant also as a function of p. 

^"^This should not be confused with the constancy of the curvature tensor (or of the sectional cur- 
vature) under parallel translation. Indeed, these latter conditions provide two alternative definitions 
of the Riemannian locally symmetric spaces p8| , pp. Theorem 8.1.1]. At least one book makes 
the unfortunate decision to use the term "space of constant curvature" for what everyone else calls 
"Riemannian locally symmetric space" . 
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(a) the real projective space RP" = ^"/{i/} 

(in) [only for n odd] the quotient space S"'/Zm withm > 2; here S"" is considered 
as the unit sphere of C^"^^^^^"^ , and the group Zm C U{1) acts by scalar 
multiplication 

(iv) [only for n = 4Z + 3, / integer] one of the quotient spaces S'^/D^ with 
m > 2, S"'/T*, S'^/0* or S^/I*; here is considered as the unit sphere 
q("-+i)/4 _ quaternions), and D^, T* , O* and I* are the dihedral, 
tetrahedral, octahedral and icosahedral groups lifted to SU{2) ~ ?7(1,Q) 
and acting by scalar multiplication 

In particular, M is compact in cases (b^) and (c), and only these. 



Proof. This is an immediate consequence of Theorem 2.7.1 and Corollary 2.7.2]. 



Corollary 3.2 Let M be a connected Riemannian symmetric space of dimension n 
and constant curvature k. Then either 

(a) k < and M is isometric to the hyperbolic space iJ" 

or 

(bm) [0 < m < n] k = and M is isometric to a product R™' x T"'~"^ 
or else 

(c) k > and M is isometric to either the sphere or the real projective space 

In particular, M is compact in cases (bo) and (c), and only these. 

Proof. Since every Riemannian symmetric space is homogeneous, it suffices to ana- 
lyze the cases in Theorem |3.1| . By []38| , Theorem 8.3.12], the quotient S^/T is a sym- 



metric space if and only if F is a discrete subgroup of A = centralizer of SO{n + 1) 
in 0(n+ 1) = {±J}. I 

A connected Riemannian manifold is said to be irreducible if it is not locally 
isometric to a product of lower-dimensional manifolds. Equivalently, M is irreducible 
if for some (or all) p ^ M the linear holonomy group acts irreducibly on the 
tangent space TpM. 

The irreducible Riemannian symmetric spaces admitting a mirror have been com- 
pletely classified by Iwahori 



Theorem 3.3 Let M be an irreducible Riemannian symmetric space of dimension 
n. Then the following are equivalent: 
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(a) M admits an involutive isometry whose fixed-point manifold has at least one 
connected component of codimension 1. 



(b) M is a space of constant curvature. 

(c) M is isometric to either the sphere 5*", the real projective space RP^ = 5"/ {±/}, 
or the hyperbolic space H^. 

[Here (b) <^==^ (c) follows from Corollary p.2| , and (c) =^ (a) is easy. The hard part 
of the proof is (a) =^ (b).] 

Thus, the only irreducible compact Riemannian symmetric spaces admitting a 
mirror are the examples we already know: the sphere 5*" and the real projective 
space RP^. 

Remarks. 1. It is, as far as we know, an open question whether (a) =^ (b) in 
Theorem |3.3| holds also for irreducible homogeneous Riemannian manifolds that are 



not symmetric spaces. 

2. Some relevant related work can be found in |H9|, pOl; unfortunately, we have 



not yet been able to obtain a copy of these articles. For related work on reflections 
in complex manifolds, see |5! 



Next we would like to classify the reducible Riemannian symmetric spaces ad- 
mitting a mirror (or more interestingly, admitting a transitive group of mirrors). 
However, since cr-models taking values in a reducible Riemannian symmetric space 
are of lesser interest for physical applications, we leave this classification for the math- 
ematicians. But it is not difficult to show that a transitive group of mirrors can exist 
only if M is a sphere, a product of spheres, or the quotient of such a space by a 
discrete group Q 



4 Numerical Study of Embedding Algorithms for 
the 0(4) Model 

In Section ^ we conjectured that the idealized embedding algorithm can perform well 
(i.e. have z <^ 2) only if it is based on a involutive isometry T whose fixed-point 
manifold has codimension 1. In order to test this conjecture, we have studied the 
two-dimensional A^-vector model for = 4 using the embedding algorithm based 
on a codimension-2 refiection [i.e. with r = 2]. For comparison, we have also 

generated additional data (extending ||^) on this same model using the codimension-1 
refiection. 



^^One amusing application involves a cr-model taking values in the space x (i.e. two uncoupled 
XY models): in addition to the standard WolfF algorithm based on the codimension-1 reflections 
9^—9 and </)—>—(/), there is another algorithm based on the codimension-1 reflections 9^4) and 
9 —(f>. Physically, this algorithm takes chunks from one system and pastes them into the other 
system (possibly reflected). We conjecture that this algorithm will also have z ~ 0. 
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4.1 Quantities to be Measured 

For any observable A, define the unnormalized autocorrelation function 



(4.1) 



where expectations are taken in equilibrium. The corresponding normalized autocor- 
relation function is 

PAA{t) = Caa(0/Caa(0) . (4.2) 
We then define the integrated autocorrelation time 



nnt,A = 9 X] PAA{t) 
^ t = -oo 
-j^ oo 

^ t = l 



(4.3) 



[The factor of | is purely a matter of convention; it is inserted so that Tint,A ~ t 
if pAA{t) ^ e"!*!/"^ with r » 1.] The integrated autocorrelation time controls the 
statistical error in Monte Carlo measurements of (A). More precisely, the sample 
mean 

A ^ l-±A, (4.4) 



n 



t=i 



has variance 



yax{A) = 



n 



E CAA{r-s) 

r,s=l 
-I n—l 

n ^ <^ 

t = _(„_!) 

1 



\t\ 

n 



n 



)CAA(t) 

(2Tint,A) Caa{0) forn > t 



(4.5) 
(4.6) 



Thus, the variance of A is a factor 2Tint,A larger than it would be if the {A^} were 
statistically independent. Stated differently, the number of "effectively independent 
samples" in a run of length n is roughly n/2Tint,A- The autocorrelation time Tint,A 
(for interesting observables A) is therefore a "figure of (de)merit" of a Monte Carlo 
algorithm. 

We shall measure static quantities (expectations) and dynamic quantities (auto- 
correlation times) for the following observables: 

= (E^^V (4-7) 



E 







2 




2" 






+ 








X 




X 





E '^x- (^x' 

{xx') 



(4.8) 
(4.9) 
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The mean values of these observables give information on different aspects of the 
2-point function 



G{x) = {(To ■ (T^) 

G{p) = J2e''-^(To-a,) 

X 

In particular, we are interested in the susceptibility 

X = G{0) = V-\M') 
and the analogous quantity at the smallest nonzero momentum 

F = = V-'iJ'). 

By combining these we can obtain the (second-moment) correlation length 

f = ( (x/F)-i y^' 

^ \4sm^{n/L)) 
(see the Remark below). Finally, we have the (negative) energy 

E = 2G'(x)||,|=i = V-\S) . 
Here = is the number of sites in the lattice. 



(4.10a) 
(4.10b) 

(4.11) 
(4.12) 

(4.13) 



Remark. The definition (|4.13| ) is sometimes summarized 
fitting G{p) to the Ansatz 



G{p) 



r' + 4^sin2(p./2) 



i=l 



-1 



(4.14) 
by saying that we are 

(4.15) 



at p = and \p\ = 2tt/L. This is of course true; but it is important to emphasize 
that we are not assuming that G{p) really has the free-field form ( ^.15] ) at general p 
(of course it doesn't). Rather, we simply use this form to motivate one reasonable 
definition of the second-moment correlation length ,^ on a finite lattice. Another 
definition — slightly different but equally reasonable — would be 



X \i=l 



7r)^ sin^(7rxj/L) G{x 



\ 



1/2 



2d 



X 



(4.16) 



The two definitions coincide in the infinite- volume limit L ^ oo. Finally, let us 
emphasize that neither of these quantities is equal to the exponential correlation 
length (= 1/mass gap) 



\x\ 



exp 



lim ■ 

Ixl^oo logG(x) 



(4.17) 
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However, ^ (or and ^exp are believed to scale in the same way as /? — >• oo. 

The integrated autocorrelation time Ti^t^A can be estimated by standard proce- 
dures of statistical time-series analysis |Q These procedures also give statis- 
tically valid error bars on {A) and Tint,A- For more details, see Appendix C]. 
In this paper we have used in all self-consistent truncation window of width 

Q'Tint,A- In setting error bars on ^ we have used the triangle inequality; such error bars 
are overly conservative, but we did not feel it was worth the trouble to measure the 
cross-correlations between M.^ and T. 



4.2 Numerical Results 

In Table ^ we present results using the codimension-1 embedding algorithm (i.e. 
the usual Wolff embedding); the induced Ising model is simulated by hits of the 
standard SW algorithm. Additional data on larger lattices (but only for N^n = 1) can 
be found in . In Tables ^ and ^ we present the analogous results for the codimension- 
2 algorithm; here the induced Ising model is simulated by Nhu hits of the standard SW 
algorithm generalized in the obvious way to a mixed ferromagnetic/antiferromagnetic 
Ising model.Q 

In Table ^ we summarize, for the convenience of the reader, our best estimates of 
the static quantities ^ and E on lattices of size L = 32, 64. These estimates come 
from merging all the data in Tables |T]-^ together with the data from our earlier work 
on the codimension-1 algorithm and from a separate work using the multi-grid 
Monte Carlo algorithm We use these merged data in our finite-size-scaling 

analyses whenever ^ is required. (An analogous table for L = 128, 256 can be found 
inpl.) 

In the codimension-1 algorithm, the induced Ising model is ferromagnetic, and 10 
SW hits are already enough to produce an "almost independent" sample from the 
induced Ising model. As a result, the autocorrelation times for Nhu = 10, 20 are 
equal within error bars, and their average can be taken to represent the idealized 
{Nhit = oo) algorithm, as shown in Table |l[ 

In the codimension-2 algorithm, by contrast, the induced Ising model is highly 
frustrated (see Table |^), and all known algorithms for updating this model are ex- 
tremely inefficient. Both the standard SW algorithm (as generalized to mixed fer- 
romagnetic/antiferromagnetic Ising models) and the single-site Metropolis algorithm 
have very slow convergence to equilibrium, which gets worse as jS and L are increased. 

^^The data points at f3 — 2.20, Nhu = 1 for i = 32, 64 may be unreliable, because the data 
discarded at the beginning of the run may have been insufficient to ensure equilibrium. (Indeed, 
the values for x, C and E seem systematically low.) Unfortunately, the raw data files for these runs 
were lost, so we are unable to reanalyze them with a larger discard interval. We include these data 
points in Tables § and |^ simply to show the order of magnitude of Tint.M^ ^-nd Tint. e- We do not 
include them in the table of merged static data (Table 

^^The (3 — 2.20, N^u = 1 points from Tables || and || are not included in these means. In all other 
cases, X, ^ and E from the different algorithms agree within error bars; this is strong evidence that 
all the programs are correct! 
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Codimension = 1 



L 



N, 



hit 



Sweeps 



X 



E 



1.70 
1.70 
1.70 
1.70 



1 
10 
20 

oo 



100000 
50000 
50000 



27.4 
28.0 
27.4 



0.2 
0.2 
0.2 



3.54 
3.71 
3.58 



0.01 
0.02 
0.02 



0.98438 
0.98400 
0.98399 



0.00037) 
0.00037) 
0.00039) 



4.53 
3.61 
3.76 
3.68 



0.16 
0.15 
0.16 
0.11 



11.39 
5.80 
6.11 
5.96 



0.61 
0.31 
0.33 
0.23 



1.80 
1.80 
1.80 
1.80 



1 
10 
20 

oo 



50000 
50000 
50000 



41.4 

40.9 
41.2 



0.4 

0.3 
0.3 



4.70 

4.61 
4.65 



0.03 

0.03 
0.03 



1.04361 

1.04445 
1.04422 



0.00057) 

0.00039) 
0.00039) 



5.77 

4.04 
4.44 
4.24 



0.31 
0.18 
0.21 
0.14 



13.57 
6.34 
6.76 
6.55 



1.11 
0.36 
0.39 
0.27 



1.90 
1.90 
1.90 
1.90 



1 
10 
20 

oo 



50000 
50000 
50000 



62.8 
61.6 
63.3 



0.6 
0.5 
0.5 



6.03 
5.92 
6.08 



0.04 
0.03 
0.04 



1.10225 
1.10092 
1.10168 



0.00054) 
0.00037) 
0.00037) 



6.80 
4.37 
4.83 
4.60 



0.40 
0.21 
0.23 
0.16 



13.00 
6.35 
6.21 

6.28 



1.05 
0.36 
0.34 
0.25 



2.00 
2.00 
2.00 
2.00 



1 
10 
20 

oo 



50000 
52000 
50000 



94.4 
93.4 
94.0 



0.8 
0.7 
0.7 



7.76 
7.64 
7.71 



0.05 
0.04 
0.04 



1.15555 
1.15518 
1.15491 



0.00053) 
0.00039) 
0.00038) 



7.77 
5.59 
5.53 
5.56 



0.48 
0.29 
0.29 
0.21 



13.46 
7.60 
7.16 
7.38 



1.10 
0.46 
0.42 
0.31 



2.10 
2.10 
2.10 
2.10 



1 
10 
20 

oo 



60000 
50000 
50000 



134.9 
136.3 
134.1 



1.0 
0.8 
0.9 



9.64 
9.78 
9.58 



0.06 
0.05 
0.05 



1.20484 
1.20390 
1.20396 



0.00045) 
0.00035) 
0.00036) 



9.51 
5.92 
6.18 
6.05 



0.60 
0.32 
0.34 
0.23 



13.57 
6.89 
6.90 
6.89 



1.01 
0.40 
0.40 
0.28 



2.20 
2.20 
2.20 
2.20 



1 
10 
20 

oo 



60000 
52000 
50000 



178.2 
177.4 
180.1 



1.1 
0.9 
0.9 



11.59 
11.52 
11.64 



0.07 
0.06 
0.06 



1.24760 
1.24765 
1.24872 



0.00047) 
0.00033) 
0.00035) 



10.61 
6.23 
6.39 
6.31 



0.70 
0.34 
0.36 
0.25 



15.73 
7.02 
7.42 
7.22 



1.27 
0.41 
0.45 
0.30 



2.00 
2.00 
2.00 
2.00 



1 
10 
20 

oo 



60000 
50000 
50000 



98.6 
99.7 
99.4 



0.8 
0.8 
0.8 



7.72 
7.87 
7.91 



0.05 
0.05 
0.05 



1.15384 
1.15400 
1.15353 



0.00023) 
0.00018) 
0.00019) 



4.76 
3.79 
3.94 
3.87 



0.21 
0.17 
0.17 
0.12 



12.39 
6.48 
6.92 
6.70 



0.89 
0.37 
0.40 
0.27 



2.10 
2.10 
2.10 
2.10 



1 
10 
20 

oo 



60000 
50000 
50000 



160.2 
163.1 
159.1 



1.4 
1.4 
1.3 



10.38 
10.61 
10.28 



0.06 
0.06 
0.06 



1.20229 
1.20145 
1.20191 



0.00023) 
0.00018) 
0.00018) 



6.24 
4.76 
4.31 
4.54 



0.32 
0.23 
0.20 
0.15 



13.73 
7.23 
6.90 
7.07 



1.03 
0.43 
0.40 
0.29 



2.20 
2.20 
2.20 
2.20 



1 

10 
20 

oo 



60000 
50000 
50000 



256.4 
258.2 
257.1 



2.2 
1.9 
2.0 



13.70 

13.75 
13.70 



0.08 
0.08 
0.08 



1.24507 
1.24552 
1.24535 



0.00022) 
0.00017) 
0.00017) 



7.49 
4.86 
5.32 
5.09 



0.42 
0.24 
0.27 
0.18 



14.17 
7.24 
6.90 
7.07 



1.08 
0.44 
0.40 
0.30 



2.30 
2.30 
2.30 
2.30 



1 
10 
20 

oo 



60000 
50000 
50000 



393.4 
397.7 
395.1 



2.8 
2.7 
2.6 



17.75 
17.88 
17.82 



0.10 
0.10 
0.10 



1.28437 
1.28444 
1.28444 



0.00021) 
0.00016) 
0.00017) 



8.06 
6.04 
5.74 
5.89 



0.47 
0.33 
0.31 
0.23 



13.75 
7.38 
7.22 
7.30 



1.03 
0.45 
0.43 
0.31 



2.40 
2.40 
2.40 
2.40 



1 
10 
20 

oo 



60000 
50000 
50000 



549.6 
554.5 
556.8 



3.3 
3.0 
3.1 



21.76 
22.04 
22.09 



0.12 
0.12 
0.12 



1.31948 
1.31945 
1.31952 



0.00020) 
0.00016) 
0.00016) 



8.93 
6.54 
6.67 
6.61 



0.54 
0.37 
0.39 
0.27 



13.87 
7.66 
7.57 
7.62 



1.05 
0.47 
0.47 
0.33 



2.50 
2.50 
2.50 
2.50 



1 
10 
20 
oo 



60000 
50000 
50250 



701.1 
708.8 
708.1 



3.4 
3.0 
3.0 



25.54 
25.85 
25.91 



0.13 
0.12 
0.12 



1.35087 
1.35091 
1.35080 



0.00019) 
0.00014) 
0.00015) 



9.56 
6.18 
6.22 
6.20 



0.60 
0.34 
0.34 
0.24 



14.43 
7.11 
7.14 
7.12 



1.11 
0.42 
0.42 
0.30 



Table 1: Results at L = 32 and L = 64 from the co dimension- 1 algorithm. The 
runs were either started from an equilibrium configuration or else had 1000 sweeps 
discarded for equilibration. Rows labelled Nhu — oo are averages of the Nhn — 10, 20 
values. Standard error is shown in parentheses. 
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Codimension 



L 



Nut 



Sweeps 



X 



32 
32 
32 
32 
32 
32 
32 



1.70 
1.70 
1.70 
1.70 
1.70 
1.70 
1.70 



1 

5 

10 
20 
40 
80 
160 



100000 
100000 
50000 
50000 
50000 
80000 
100000 



28.0 
27.8 
28.3 
28.0 
27.6 
27.7 
28.0 



0.2 
0.2 
0.2 
0.2 
0.2 
0.1 
0.1 



3.68 
3.68 
3.75 
3.69 
3.57 
3.65 
3.70 



0.02 
0.01 
0.02 
0.02 
0.02 
0.01 
0.01 



0.98453 
0.98398 
0.98460 
0.98465 
0.98422 
0.98350 
0.98440 



00042 
00028 
00036 
00037 
00037 
00028 
00026 



8.92 
3.57 
2.80 
2.51 
2.30 
2.24 
2.23 



0.42 
0.11 
0.10 
0.09 
0.08 
0.06 
0.05 



14.06 
6.75 
5.63 
5.72 
5.86 
5.45 
5.76 



32 
32 
32 
32 
32 
32 
32 
32 



1.80 
1.80 
1.80 
1.80 
1.80 
1.80 
1.80 
1.80 



1 
5 

10 
20 
40 
80 
160 
320 



200000 
300000 
250000 
250000 
250000 
100000 
80000 
100000 



42.5 

41.1 
41.3 
41.3 
41.4 
41.1 
41.6 
41.5 



0.4 

0.2 
0.2 
0.1 
0.1 
0.2 
0.2 
0.2 



4.83 

4.63 
4.67 
4.68 
4.68 
4.65 
4.71 
4.69 



0.03 

0.01 
0.01 
0.01 
0.01 
0.02 
0.02 
0.02 



1.04459 

1.04460 
1.04479 
1.04447 
1.04477 
1.04444 
1.04429 
1.04440 



00031 

00018 
00018 
00018 
00018 
00028 
00031 
00027 



20.63 
7.33 
5.25 
4.17 
3.69 
3.37 
3.15 
3.19 



1.03 

0.18 
0.12 
0.09 
0.07 
0.10 
0.10 
0.09 



16.62 
8.06 
7.25 
6.72 
6.95 
6.76 
6.61 
6.38 



32 
32 
32 
32 
32 
32 
32 
32 
32 



1.90 
1.90 
1.90 
1.90 
1.90 
1.90 
1.90 
1.90 
1.90 



1 

5 

10 
20 
40 
80 
160 
320 
640 



200000 
300000 
300000 
250000 
250000 
100000 
100000 
100000 
50000 



61.7 
62.0 
62.5 
62.2 
61.8 
62.0 
62.3 
61.9 
61.7 



0.7 
0.4 
0.3 
0.3 
0.3 
0.4 
0.4 
0.4 
0.5 



5.93 
5.94 
6.00 
5.98 
5.93 
5.97 
5.96 
5.95 
5.92 



0.05 
0.02 
0.02 
0.02 
0.02 
0.03 
0.03 
0.02 
0.03 



1.10152 
1.10167 
1.10175 
1.10143 
1.10167 
1.10147 
1.10212 
1.10156 
1.10120 



00033 
00019 
00018 
00019 
00019 
00030 
00029 
00029 
00040 



43.76 
14.80 
10.79 
8.58 
6.95 
6.24 
5.54 
5.39 
4.95 



3.18 
0.51 
0.32 
0.25 
0.18 
0.25 
0.21 
0.20 
0.24 



20.17 
9.74 
8.65 
7.97 
7.98 
8.30 
7.69 
7.80 
7.33 



32 
32 
32 
32 
32 
32 
32 
32 
32 



2.00 
2.00 
2.00 
2.00 
2.00 
2.00 
2.00 
2.00 
2.00 



1 

5 

10 
20 
40 
80 
160 
320 
640 



200000 
300000 
300000 
250000 
250000 
100000 
100000 
100000 
100000 



93.2 
93.8 
92.7 
94.3 
93.4 
94.8 
94.5 
93.4 
94.5 



1.4 
0.7 
0.6 
0.5 
0.5 
0.7 
0.7 
0.7 
0.6 



7.63 
7.67 
7.61 
7.74 
7.64 
7.76 
7.73 
7.64 
7.76 



0.09 
0.04 
0.03 
0.03 
0.03 
0.04 
0.04 
0.04 
0.04^ 



1.15507 
1.15510 
1.15493 
1.15522 
1.15517 
1.15542 
1.15532 
1.15552 
1.15494 



00037 
00020 
00018 
00019 
00020 
00030 
00031 
00032 
00032 



92.52 
29.77 
22.17 
16.42 
12.52 
10.85 
10.16 
9.64 
9.24 



9.76 
1.46 
0.94 
0.66 
0.44 
0.56 
0.50 
0.47 
0.44 



26.83 
12.41 
10.20 
9.58 
9.45 
9.17 
9.64 
9.97 
9.73 



32 
32 
32 
32 
32 
32 
32 
32 



2.10 
2.10 
2.10 
2.10 
2.10 
2.10 
2.10 
2.10 



5 
10 
20 
40 
80 
160 
320 
640 



105000 
300000 
300000 
300000 
400000 
400000 
105000 
100000 



137.4 
134.5 
134.5 
134.4 
134.3 
134.8 
135.4 
134.1 



1.6 
0.8 
0.7 
0.7 
0.5 
0.4 
0.9 
0.9 



9.90 
9.60 
9.61 
9.62 
9.60 
9.66 
9.70 
9.59 



0.10 
0.10 
0.04 
0.04 
0.03 
0.03 
0.10 
0.05 



1.20400 
1.20388 
1.20391 
1.20394 
1.20391 
1.20386 
1.20409 
1.20390 



00034 
00020 
00019 
00018 
00016 
00016 
00031 
00031 



45.50 
35.91 
26.97 
21.30 
17.66 
14.88 
14.76 
13.16 



4.68 
1.93 
1.26 
0.88 
0.58 
0.45 
0.87 
0.74 



13.66 
12.78 
11.74 
10.76 
10.80 
10.77 
11.17 
10.28 



32 
32 
32 
32 
32 
32 
32 
32 



2.20 
2.20 
2.20 
2.20 
2.20 
2.20 
2.20 
2.20 



1 
5 
10 
20 
40 
80 
160 
320 



250000 
301000 
300000 
300000 
100000 
100000 
100100 
100000 



172.4 
180.0 
178.7 
179.7 
179.7 
179.0 
178.8 
179.7 



2.2 
1.1 
1.0 
0.9 
1.3 
1.2 
1.2 
1.1 



11.13 
11.68 
11.59 
11.68 
11.64 
11.59 
11.59 
11.66 



0.14 
0.07 
0.06 
0.06 
0.08 
0.08 
0.08 
0.07 



1.24682 
1.24783 
1.24791 
1.24783 
1.24786 
1.24791 
1.24801 
1.24805 



00033 
00020 
00019 
00018 
00032 
00031 
00031 
00031 



177.34 
55.67 
44.06 
34.04 
28.45 
23.63 
21.67 
19.23 



23.39 
3.72 
2.62 
1.78 
2.36 
1.78 
1.56 
1.31 



34.40 
15.41 
13.19 
12.42 
12.74 
11.38 
12.14 
11.61 



Table 2: Results at L = 32 from the codimension-2 algorithm. All runs were started 
in equilibrium, except for j3 = 2.20, N^it = 1, where 5000 sweeps (probably not 
enough) were discarded for equilibration. Standard error is shown in parentheses. 
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Codimension = 2 



/3 



Sweeps 



X 



E 



Tint,M^ 



'^int, £ 



64 

64 
64 
64 
64 
64 
64 
64 



2.00 
2.00 
2.00 
2.00 
2.00 
2.00 
2.00 
2.00 



1 
5 
10 
20 
40 
80 
160 
320 



200000 
300500 
250000 
200000 
100200 
100000 
100000 
100000 



99.7 

98.8 
100.3 

98.6 
100.1 
100.3 

98.9 

99.1 



1.9 

0.9 
0.8 
0.8 
1.0 
0.9 
0.8 
0.7 



7.86 
7.79 
7.93 
7.80 
7.88 
7.97 
7.75 
7.80 



0.10 

0.05 
0.05 
0.04 
0.06 
0.05 
0.04 
0.04 



1.15404 
1.15368 
1.15382 
1.15368 
1.15392 
1.15371 
1.15374 
1.15381 



00017 
00010 
00010 
00011 
00015 
00015 
00014 
00014 



89.48 
31.71 
21.06 
15.99 
12.32 
9.59 
7.43 
6.53 



9.28 
1.60 
0.95 
0.70 
0.67 
0.46 
0.32 
0.26 



24.59 
11.73 
9.98 
9.56 
9.19 
9.06 
8.40 
8.61 



64 
64 
64 
64 
64 
64 
64 
64 
64 



2.10 
2.10 
2.10 
2.10 
2.10 
2.10 
2.10 
2.10 
2.10 



1 
5 
10 
20 
40 
80 
160 
320 
640 



50000 
950000 
700000 
750000 
400000 
400000 
100000 
100250 

25000 



152.8 
159.9 
158.9 
159.7 
161.7 
161.1 
159.0 
162.2 
162.5 



9.9 
1.3 
1.2 
1.0 
1.2 
1.1 
2.0 
1.8 
3.3 



10.00 
10.36 
10.26 
10.35 
10.51 
10.47 
10.29 
10.50 
10.48 



0.40 
0.05 
0.05 
0.04 
0.05 
0.05 
0.09 
0.10 
0.15 



1.20202 
1.20184 
1.20180 
1.20189 
1.20181 
1.20179 
1.20197 
1.20184 
1.20178 



00036 
00006 
00006 
00005 
00008 
00008 
00015 
00015 
00028 



292.77 
81.32 
56.19 
40.65 
29.75 
24.17 
20.22 
16.39 
13.96 



110.90 
3.70 
2.47 
1.48 
1.26 
0.92 
1.42 
1.04 
1.62 



30.05 
13.58 
12.06 
11.16 
10.37 
10.19 
10.26 
10.55 
9.06 



64 
64 
64 
64 
64 
64 
64 
64 
64 



2.20 
2.20 
2.20 
2.20 
2.20 
2.20 
2.20 
2.20 
2.20 



1 
5 
10 
20 
40 
80 
160 
320 
640 



200000 
1000000 
1000000 
800000 
600000 
400100 
400000 
200199 
102840 



244.3 
256.5 
258.1 
261.9 
257.0 
260.0 
255.6 
254.3 
251.1 



10.3 
2.5 
2.0 
2.0 
1.9 
2.2 
2.0 
2.5 
3.2 



12.88 
13.69 
13.75 
14.01 
13.71 
13.94 
13.65 
13.54 
13.36 



0.39 
0.10 
0.08 
0.08 
0.08 
0.08 
0.08 
0.10 
0.12 



1.24504 
1.24521 
1.24532 
1.24540 
1.24534 
1.24541 
1.24536 
1.24527 
1.24524 



00020 
00006 
00005 
00006 
00006 
00008 
00008 
00011 
00015 



540.92 
165.44 
109.43 
84.23 
60.90 
50.95 
45.13 
35.28 
29.52 



141.41 
10.43 
5.61 
4.24 
3.01 
2.82 
2.35 
2.30 
2.46 



36.60 
15.61 
13.98 
12.99 
12.15 
11.79 
11.50 
11.26 
11.74 



Table 3: Results at L = 64 from the codimension-2 algorithm. All runs were started 
in equilibrium, except for [3 = 2.20, N^it = 1, where 10000 sweeps (probably not 
enough) were discarded for equilibration. Standard error is shown in parentheses. 
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Merged 0(4)-Model Static Data 



X 



E 



32 
32 
32 
32 
32 
32 
32 
32 
32 



1.62 
1.70 
1.80 
1.90 
2.00 
2.10 
2.20 
2.30 
2.40 



21.0 
27.8 
41.3 
62.1 
93.7 
134.6 
178.8 
221.1 
258.5 



3.09 
3.65 
4.67 
5.96 
7.68 
9.63 
11.60 
13.42 
14.94 



0.02) 
0.01) 
0.01) 
0.01) 
0.01) 
0.01) 
0.02) 
0.02) 
0.03) 



0.93417 (0, 
0.98418 (0, 
1.04456 (0, 
1.10161 (0, 
1.15509 (0, 
1.20398 (0, 
1.24787 (0, 
1.28685 (0, 
1.32155 (0, 



00054) 
00011) 
00007) 
00007) 
00004) 
00004) 
00004) 
00004) 
00007) 



64 
64 
64 
64 
64 
64 
64 
64 
64 
64 
64 
64 
64 
64 
64 
64 
64 
64 
64 



1.70 
1.80 
1.90 
2.00 
2.10 
2.15 
2.20 
2.22 
2.24 
2.26 
2.28 
2.30 
2.35 
2.40 
2.43 
2.45 
2.50 
2.55 
2.60 



27.9 
41.2 
64.1 
99.4 
160.5 
202.9 
257.9 
283.3 
309.5 
337.4 
364.7 
395.0 
471.5 
551.6 
601.5 
632.5 
706.7 
779.0 
847.9 



3.65 
4.60 
6.10 
7.83 
10.40 
11.93 
13.77 
14.58 
15.38 
16.20 
16.93 
17.81 
19.87 
21.92 
23.22 
23.97 
25.77 
27.49 
29.06 



0.02) 
0.10) 
0.10) 
0.01) 
0.01) 
0.03) 
0.02) 
0.04) 
0.04) 
0.04) 
0.04) 
0.03) 
0.05) 
0.03) 
0.05) 
0.05) 
0.04) 
0.05) 
0.04) 



1.15378 
1.20184 
1.22410 
1.24532 
1.25349 
1.26149 
1.26932 
1.27692 
1.28442 
1.30244 
1.31950 
1.32931 
1.33563 
1.35092 
1.36545 
1.37928 



(0.00004) 
(0.00002) 
(0.00003) 
(0.00001) 
(0.00003) 
(0.00003) 
(0.00003) 
(0.00003) 
(0.00001) 
(0.00003) 
(0.00001) 
(0.00002) 
(0.00002) 
(0.00001) 
(0.00002) 
(0.00002) 



Table 4: Best estimates of susceptibility, correlation length and energy for the 0(4) 
model on 32 x 32 and 64 x 64 lattices, from Tables and [|, Standard error is 
shown in parentheses. 
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Co dimension 


= 2 


T 

1j 


P 


Frustration 


32 


1.70 


0.2571 


(0.0001) 


32 


1.80 


0.2452 


(0.0001) 


oZ 


1.90 


0.2329 


(0.0001) 


32 


2.00 


0.2205 


(0.0001) 


32 


2.10 


0.2086 


(0.0001) 


32 


2.20 


0.1973 


(0.0001) 


64 


2.00 


0.2209 


(0.0001) 


64 


2.10 


0.2092 


(0.0001) 


64 


2.20 


0.1980 


(0.0001) 



Table 5: Average fraction of frustrated plaquettes in the induced Ising model arising 
from the codimension-2 algorithm. Standard error is shown in parentheses. 



A combination of SW and Metropolis appears to be no better (in rate of convergence 
per unit CPU time) than SW alone. Therefore, we had no choice but to simulate the 
induced Ising model with the best algorithm available to us — namely, standard SW 
— and to use an enormous number of hits in an effort to approximate the behavior 
of the idealized embedding algorithm. 

(After our runs were completed, Kandel, Ben-Av and Domany |5^ reported very 



encouraging results for simulating a frustrated Ising model using an ingenious new 
algorithm of Swendsen-Wang type. Their algorithm is apparently successful in cases 
of full frustration, but not in cases of partial frustration. In the codimension-2 0(4) 
application at the (3 and L values studied here, the induced Ising model has approx- 
imately 20-25% of the plaquettes frustrated. We therefore do not expect the Kandel 
et al. algorithm to work miracles, but even a factor-of-2 speedup over standard SW 
would be highly desirable. We have not yet had an opportunity to try the Kandel et 
al. algorithm in our induced Ising model, but we hope to do so in the near future.) 

From Tables ^ and ^ we see that even at N^n = 640, the autocorrelation time 
Tint,M'^ for the codimension-2 algorithm has not stabilized (except for L = 32 and 
f3 ^ 1.8). Therefore, we are obliged to attempt an extrapolation of our data to 
Nhit = oo. This turns out to be a very tricky business, as the behavior of Tint^M^ a 
function of is extremely complicated (and we have no theoretical understanding 
of it). We tried empirical fits of the form 

rint,M^iNhit) = a+T^ (4-18) 

where a = Tint,M^{^hit = oo) and b are variable, and A is some fixed exponent. (We 
also tried fits with A variable, but these fits were quite unstable, and the error bars 
on A were large.) Reasonable fits can be obtained provided that one discards the 
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P\A\ OA (15 (16 oTt oI 09 To O TT 



1.70 

\3 DFl 


1.93(0.08) 2.01(0.07) 2.07(0.06) 2.11(0.05) 2.14(0.05) 2.16(0.04) 2.18(0.04) 2.19(0.04) 2.20(0.04) 
2.71 2.05 1.50 1.06 0.73 0.50 0.36 0.32 0.34 


1.80 

f4 DFl 


2.32(0.10) 2.57(0.08) 2.74(0.07) 2.87(0.06) 2.96(0.06) 3.04(0.06) 3.10(0.05) 3.16(0.05) 3.20(0.05) 
19.17 12.80 7.86 4.34 2.16 1.20 1.31 2.30 4.03 


1.90 

[5 DF] 


3.45(0.19) 4.04(0.16) 4.44(0.14) 4.73(0.13) 4.95(0.12) 5.12(0.12) 5.26(0.11) 5.38(0.11) 5.48(0.10) 
11.00 5.48 2.25 1.22 2.16 4.81 8.86 14.01 19.94 


2.00 

[5 DF] 


5.91(0.43) 6.98(0.36) 7.70(0.32) 8.22(0.29) 8.62(0.27) 8.94(0.26) 9.20(0.25) 9.42(0.24) 9.60(0.24) 
22.16 14.70 8.78 4.51 1.87 0.77 1.04 2.50 4.93 


2.10 

[5 DF] 


7.10(0.70) 9.23(0.57) 10.69(0.49) 11.77(0.43) 12.59(0.39) 13.23(0.36) 13.75(0.34) 14.17(0.33) 14.51(0.32) 
11.16 5.92 2.88 1.94 2.88 5.42 9.25 14.08 19.60 


2.20 

[4 DF] 


10.99(1.60) 13.96(1.34) 15.92(1.18) 17.31(1.08) 18.33(1.01) 19.12(0.96) 19.74(0.92) 20.24(0.89) 20.66(0.87) 
1.10 0.43 0.20 0.38 0.93 1.82 2.99 4.39 5.98 




2.00 

[4 DF] 


1.52(0.40) 3.23(0.32) 4.36(0.27) 5.15(0.24) 5.73(0.22) 6.16(0.21) 6.50(0.20) 6.76(0.20) 6.97(0.19) 
2.80 1.09 1.65 4.32 8.85 14.92 22.21 30.40 39.19 


2.10 

[5 DF] 


3.67(1.16) 7.98(0.97) 10.90(0.85) 13.03(0.76) 14.65(0.70) 15.93(0.66) 16.97(0.63) 17.82(0.61) 18.53(0.59) 
6.03 2.11 0.99 2.48 6.28 12.01 19.30 27.76 37.05 


2.20 
[5 DF] 


12.01(2.34) 19.84(1.94) 25.09(1.70) 28.85(1.54) 31.66(1.44) 33.84(1.36) 35.56(1.30) 36.95(1.26) 38.08(1.23) 
4.43 3.48 4.58 7.53 12.07 17.88 24.63 32.03 39.79 



Table 6: Extrapolated values of Tint, M'^i.^ hit = cxd) for L = 32 (upper half of table) 
and L = 64 (lower half), based on a least-squares fit to ( |4.18| ) using the data for 



Nhit > 10; error bars are one standard deviation, statistical error only. The second 
line is the value of x^'i number of degrees of freedom is indicated in the first column. 
Our subjective selection of the "best" fit is marked in boldface. 



data from the lowest values of Nhn (namely, Nhn = 1, 5); but the optimal exponent A 
is not well determined, and it moreover shows a clear (and more-or-less but not quite 
monotonic) variation with f3 and L. Of course, we have no explanation whatsoever for 
these empirical observations. (That is, we have no theoretical understanding of the 
dynamic behavior of the SW algorithm on a highly frustrated Ising model.) Therefore, 
the best we can do is to report the results of our extrapolations — allowing for a wide 
range of values of A — and let the reader judge their reasonableness. In Figures 
0, ^ and ^ we plot Tint,M^ versus l/N^^^ for a few selected values of A. In Table ^ 
we show the extrapolated values of Tint, M'^i.^ hit = cxd) as a function of the exponent 
A, based on a least-squares fit to the data with Nhit > 10. Our subjective selection 
of the "best" fit is marked in boldface. The A values in a range ±0.1 around this 
"best" value yield also reasonable fits; we take these three extrapolants as a 68% 
subjective confidence interval on Tint, M'^i.^ hit = oo). Clearly there is a very wide 
range of "reasonable" extrapolants, especially for L = 64 at the higher values of (3. 
Our final results will therefore have a very large systematic uncertainty. 

For the autocorrelation time Tint,e, the extrapolation to N^it = oo is fortunately 
less problematic: in most cases Tint,s is constant within error bars for N^n ^ 40, or 
at worst it shows in this region a very slow decrease as a function of Nhn- In Table 
we show the extrapolated values of Tint,e{Nhit = oo) as a function of the exponent 
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Figure 2: Tint,M^ versus l/A^^^ with A = 0.6, for L = 32 and /3 = 1.7 (+), 1.8 (x), 
1.9 (O), 2.0 (□), 2.1 (O), 2.2 (o). 
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Figure 3: Tint,M^ versus l/A^^^ with A = 0.8, for L = 32 and = 1.7 (+), 1.8 (x), 
1.9 (O), 2.0 (□), 2.1 (O), 2.2 (o). 



35 




Figure 4: Tint,M^ versus with A = 0.6, for L = 64 and /3 = 2.0 (□), 2.1 (Q) 

2.2 (o). 



36 



3 \ A 


04 5 6 7 8 9 1 1 1 1 2 


1.70 

[3 DF] 


5.65(0.30) 5.65(0.25) 5.65(0.22) 5.65(0.20) 5.66(0.18) 5.66(0.17) 5.66(0.16) 5.66(0.16) 5.66(0.15) 
1.55 1.55 1.55 1.55 1.55 1.56 1.56 1.56 1.56 


1.80 

[4 DF] 


6.28(0.24) 6.37(0.20) 6.44(0.18) 6.49(0.16) 6.52(0.15) 6.55(0.14) 6.57(0.14) 6.59(0.13) 6.61(0.13) 
3.21 3.23 3.25 3.29 3.32 3.36 3.40 3.43 3.47 


1.90 

[5 DF] 


7.36(0.27) 7.48(0.23) 7.55(0.21) 7.61(0.19) 7.65(0.18) 7.68(0.17) 7.71(0.16) 7.73(0.16) 7.75(0.15) 
3.76 3.65 3.56 3.49 3.43 3.39 3.35 3.33 3.31 


2.00 

[5 DF] 


9.33(0.34) 9.37(0.29) 9.39(0.26) 9.41(0.24) 9.42(0.23) 9.43(0.22) 9.44(0.21) 9.45(0.20) 9.46(0.19) 
4.31 4.07 3.83 3.60 3.39 3.19 3.01 2.85 2.70 


2.10 

[5 DF] 


9.77(0.32) 10.02(0.27) 10.19(0.24) 10.31(0.22) 10.40(0.20) 10.47(0.19) 10.52(0.18) 10.57(0.17) 10.61(0.17) 
5.59 4.79 4.12 3.57 3.13 2.79 2.56 2.40 2.32 


2.20 

[4 DF] 


11.06(0.57) 11.28(0.49) 11.42(0.44) 11.52(0.40) 11.60(0.38) 11.66(0.36) 11.71(0.34) 11.75(0.33) 11.78(0.32) 
2.06 1.98 1.93 1.90 1.89 1.90 1.92 1.96 2.00 




2.00 

[4 DF] 


8.00(0.37) 8.22(0.31) 8.36(0.28) 8.46(0.26) 8.54(0.24) 8.60(0.23) 8.65(0.22) 8.69(0.22) 8.72(0.21) 
0.77 0.84 0.97 1.13 1.32 1.53 1.76 2.00 2.24 


2.10 

[5 DF] 


8.96(0.34) 9.29(0.29) 9.51(0.25) 9.67(0.23) 9.80(0.21) 9.90(0.20) 9.98(0.19) 10.04(0.18) 10.10(0.17) 
4.81 4.06 3.52 3.15 2.94 2.86 2.89 3.02 3.23 


2.20 
[5 DF] 


10.34(0.30) 10.70(0.25) 10.95(0.22) 11.13(0.21) 11.26(0.19) 11.37(0.18) 11.45(0.17) 11.52(0.17) 11.58(0.16) 
2.31 1.49 1.01 0.84 0.92 1.20 1.65 2.22 2.88 



Table 7: Extrapolated values of Tint^eiNhu = oo) for L = 32 (upper half of table) 
and L = 64 (lower half), based on a least-squares fit to ( |4.18| ) using the data for 



Nhit > 10; error bars are one standard deviation, statistical error only. The second 
line is the value of x^] number of degrees of freedom is indicated in the first column. 
Our subjective selection of the "best" fit is marked in boldface. 



A, based on a least-squares fit to the data with Nhu > 10. Here the extrapolation 
is in most cases insensitive to the choice of A, and there is no discernible systematic 
preference for one or another value of A. Therefore we have chosen (rather arbitrarily) 
A = 1 as the "best" extrapolant, and the range 0.8 < A < 1.2 as defining a 68% 
subjective confidence interval on Tint,e{Nhit = cc). 

We can now make a finite-size-scaling analysis of the dynamic critical behavior, 
using the Ansatz 

nntAP,L) ~ af3,Lr"''^gA{a(^.L)lL) (4.19) 

for A = Jli^,£. Here qa is an unknown scaling function, and (7^(0) = lim^iQ gA{x) is 
supposed to be finite and nonzero.0 We emphasize that the dynamic critical exponent 
Zint,A is in general different from the exponent Zg^p associated with the exponential 
autocorrelation time Texp |0, p9[] . 

It is of course equivalent to use the Ansatz 

and indeed the two Ansatze are related by hA{x) = x^"'*''*.g^(x). However, to determine whether 
\hnxio gA{x) — lhnxiox~^^"*-^hA{x) is nonzero, it is more convenient to inspect a graph of gA than 
one of hA- 
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For the co dimension- 1 idealized algorithm {Nhu = c>o), the dynamic critical ex- 
ponents are very close to zero: we estimate Zint,e = 0.1 ± 0.1 and Zint^M"^ = 0.0 ±0.1 
(subjective 68% confidence intervals). Finite-size-scaling plots of Tj^f^yi^"^™'--^ versus 
^/L are shown in Figures ^ and These estimates are of course in agreement with 
our previous results showing that even for Nhu = 1 we have z very close to zero. 

For the codimension-2 idealized algorithm, a similar analysis using the extrapo- 
lated data from Tables ^ and |^ yields the estimates Zint,M^ = 1-5 ± 0.5 and Zint,e = 
0.5 ± 0.2 (subjective 68% confidence intervals). The finite-size-scaling plots corre- 
sponding to these values of z are shown in Figures |^ and |[ Clearly the estimate for 
Zint,M^ is very imprecise, as a result of the large systematic error bars in the extrapo- 
lation of Tint,M^ fo'^ L = 64. All we can say with much certainty is that Zint,M'^ is far 
from zero; it is consistent with our data, but not at all guaranteed, that Zint^M'^ ~ 2. 

It is interesting to note the relative sizes of Tint,M'^ as compared to Tint^s (and also 
of ZintM"^ compared to Zint,e)- For the traditional local algorithms (e.g. single-site 
heat-bath), M.^ is a much slower mode than because it is much more strongly 
coupled to the long-wavelength spin waves that evolve slowly in the local dynamics.0 
For the codimension-1 idealized Wolff algorithm, by contrast, the two observables 
have roughly equal autocorrelation times (and in fact £^ is a little slower): the Wolff 
collective moves are apparently equally effective at equilibrating fluctuations on all 
length scales. For the codimension-2 idealized algorithm, however, M.^ is again much 
slower than £: this conflrms our view that the codimension-2 reflection is ineffective 
at equilibrating long-wavelength spin waves, and that its effect is "primarily local" . 

We wish to emphasize once again the importance of studying the idealized em- 
bedding algorithm, and the misleading conclusions that can be caused by the failure 
to do so. Indeed, suppose that we had studied the codimension-2 algorithm only for 
Nhit = 1- Then, by the usual dynamic flnite-size-scaling analysis, we would have con- 
cluded that Zint^M'^ ~ 3 and Zint,e ~ 0.75 {very roughly), i.e. that the codimension-2 
embedding algorithm is disastrously bad. Now, this conclusion happens to be true, 
but not for the reason claimed! In fact, the enormous autocorrelation times Tj„t ^2 in 
the Nhit = 1 algorithm reflect primarily the inability of the SW algorithm to update 
efficiently the highly frustrated induced Ising model (as one can verify by comparing 
to larger values of Nhu), and not any intrinsic defect of the codimension-2 embedding. 
The intrinsic properties of the embedding are found only by considering Nhu — ^ 00. 

The CPU time for this program is approximately 5.35 x NhuL"^ /is/sweep on a 
Cray Y-MP 8/432. Thus, the total CPU time for the the runs reported here is 
approximately 4000 Cray hours.Q 



See e.g. ||T^ for some quantitative measurements. 

^'^ This is only an "equivalent" figure, as the runs were actually performed on a variety of super- 
computers and RISC workstations: see the Acknowledgments. 
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Figure 5: Dynamic finite-size-scaling plot of Tint,£^ ^™*"^ versus ^/L for Zint,£ = 0.1, 
for the idealized {Nhn — oo) codimension-1 algorithm on L = 32 (□) and L = 64 (x). 
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Figure 6: Dynamic finite-size-scaling plot of Tint, m^^ ^mm^ versus ^/L for Zint,M^ ~ 
0, for the idealized {Nhu = oo) codimension-1 algorithm on L = 32 (□) and L = 64 

(X). 
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Figure 7: Dynamic finite-size-scaling plot of Tint^M^^"^""""^^ versus ^/L for Zint^M^ = 
1.5, for the idealized {Nhu = oo) codimension-2 algorithm. Data points are from the 
extrapolations in Table ^with L = 32 (□) and L = 64 (x). 
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Figure 8: Dynamic finite-size-scaling plot of Tint,si~^""''^ versus ^/L for Zint,s = 0.5, 
for the idealized {N^u = oo) codimension-2 algorithm. Data points are from the 
extrapolations in Table ^with L = 32 (□) and L = 64 (x). 
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5 Discussion 



Let us now place our analysis of Wolff-type embedding algorithms into the more gen- 
eral context of arbitrary embedding algorithms. The general idea is to "foliate" 



the configuration space of the original model into "leaves" isomorphic to the config- 
uration space of some "embedded" model. One then moves around the current leaf, 
using any legitimate Monte Carlo algorithm for simulating the conditional probabil- 
ity distribution restricted to that leaf (i.e. the induced Hamiltonian for the embedded 
model). Of course, one must combine this move with other moves, or with a different 
foliation, in order to make the algorithm ergodic. (This same structure arises also in 



multi-grid Monte Carlo, where it is termed "partial resampling" ||5^, |T^, [T^.) 

As noted in the Introduction, the performance of an embedding algorithm is de- 
termined by the combined effect of two completely distinct issues: 

i) How well the embedding captures the important large-scale collective modes of 
the original model. The point is that these modes must be realizable by motions 
within a leaf. 

ii) How well some particular algorithm (e.g. standard SW or single-cluster SW) 
succeeds in updating the embedded model. 

It is crucial to employ a test procedure that disentangles these two issues, if one wants 
to obtain physical insight into why a particular embedding algorithm does or does 
not work well. 

In all of the recently-invented embedding algorithms — Wolff-type algorithms for 
nonlinear a- models and Nt = 1 SU{2) gauge theories [0], spin-flip algorithms for 



one-component scalar-field models |611], and reflection algorithms for solid-on-solid 
(SOS) and anharmonic-crystal models |^ — the embedded model happens to be an 



Ising model. But the principle is much more general: for example, one might consider 
embeddings of U{1) spins in a higher a-model, U{1) or spins in an SU{N) gauge 
theory, etc. Of course, the nontrivial problem is to find an embedding that captures 
at least some of the important collective modes of the original model, and once this 
has been done, to find an efficient algorithm for updating the embedded model. 

In Section ^ we explained why an idealized embedding algorithm will do a good 
job of handling long- wavelength spin waves if, in the induced Hamiltonian for the 
embedded model, x-space is divided into large disconnected regions which are almost 
decoupled from each other. We furthermore conjectured that in a Wolff-type embed- 
ding of Ising spins into a nonlinear cr-model, this is the only mechanism by which 
long-wavelength spin waves can be handled well; and our numerical results in Section 
^ gave modest support for this conjecture in at least one case. However, we wish to 
point out that whatever the status of this conjecture for embeddings of Ising spins, 
it certainly cannot be true as a general proposition about embedding algorithms. To 
see this, consider the extreme case in which there is only one leaf (namely, the whole 
configuration space): then the idealized embedding algorithm performs perfectly (it 
is just independent sampling from the Gibbs measure of the original model), but the 
induced Hamiltonian for the embedded model does not decouple anywhere (it is just 
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the original Hamiltonian) . This rather trivial extreme example shows that decoupling 
cannot be the only mechanism by which an idealized embedding algorithm can work 
well. More generally, consider any two foliations J^i and JF2 of the same configuration 
space, such that J^i is a refinement of JF2 (i.e. the leaves of JF2 are unions of leaves 
from J-'i). It is intuitively clear that the idealized embedding algorithm based on the 
foliation JF2 (plus possibly other moves) will perform better than the one based on 
foliation J-'i (plus the same other moves): in JF2 there is more freedom to move around 
within a leaf. On the other hand, the induced Hamiltonian based on JF2 will have 
fewer (if any) surfaces in x-space along which it decouples, compared to the induced 
Hamiltonian based on J-'i: the larger the leaves, the more stringent the requirement 
of a complete decoupling. So "performance" and "decoupling" have opposite mono- 
tonicities as a function of the "size" of the leaves. It follows that decoupling cannot 
be the general principle that explains the good or bad performance of embedding 
algorithms. 

In particular, it is not justified to insist on the codimension-1 property when 
considering non-Ising embeddings — and this is fortunate, since by Theorem |A.9| (see 



also the remarks following it), only Ising embeddings can have the codimension-1 
property! 

There are many possibilities for embedding algorithms: each such algorithm can 
be interpreted as "reducing" one simulation problem to another (hopefully simpler) 
one. For example, consider a (possibly frustrated) SU{N) principal chiral model 
defined by the Hamiltonian 

H{{U}) = -^RetriA^yUlUy), (5.1) 
{xy) 

where the A^y are N x N complex matrices. Then one can embed U{1) (= XY) 
spins as follows: let T = diag(i, —i, 0, . . . , 0) G su{N), let i? be a random element of 
SU{N), and embed the field {6x} of XY spins according to the rule 

t/;""" = i?exp(^^T)i?-^f/°''^ . (5.2) 

It is easy to see that the induced XY Hamiltonian is of the form 

H{{0}) = - E [»-y cos(^x - dy) + P^y - ^y)] , (5-3) 

i^y) 

i.e. it is a nearest-neighbor XY model, with couplings that are in general frustrated 
(even if the original SU{N) model is ferromagnetic). We conjecture that the idealized 
embedding algorithm corresponding to this embedding has dynamic critical exponent 
2; ~ 0, at least if the SU (N) model is ferromagnetic: the idea is that the spin waves 
in SU{N), which are approximately Gaussian, can be obtained by superposing spin 
waves in the various U{1) subgroups generated by the RTR~^. If this is the case, 
then the problem of simulating an SU{N) principal chiral model (possibly even a 
frustrated one) has been reduced to the problem of simulating a frustrated XY model. 
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Unfortunately, we liave no idea liow to carry out efficiently the latter simulation.^ 
But if, some day in the future, the latter problem should be solved, then it is useful 
to know that the former problem would also be solved. 

It is also interesting to note that the Wolff and multi-grid algorithms can be 
understood from a unified perspective. In both cases one exploits an exact symmetry 
of the model (global reflection in the case of Wolff, global rotation in the case of 
multi-grid) and applies it in an inhomogeneous way (constant on clusters in the case 
of Wolff, constant on cubical blocks in the case of multi-grid). In both cases the energy 
cost is a surface term. This perspective may be useful in suggesting generalizations of 
the Wolff and/or multi-grid algorithms to broader classes of models, especially lattice 
gauge theories. 

For example, consider a f/(l) gauge theory with Hamiltonian 

H = -Y^PpcosIy^oA , (5.4) 
P yeP J 

where 9i G [0,27r] is a gauge potential on the oriented link i, J^e^p^e is a properly 
oriented sum over the links bounding the plaquette P, and /?p > for all P. (Usually 
all j3p will be equal.) Next let {9^} be an arbitrary curvature- free gauge field, i.e. a 
ground state for the Hamiltonian H.f^ Now it is easy to see that a global reflection 
of {Oi} around {6'^} is a symmetry of H. Let us therefore consider applying this 
reflection in an inhomogeneous way, i.e. let us consider the embedding of Ising 
spins {si} defined by 

9e QT" = ei + e,{e,-ei) . (5.5) 

Under this updating, the energy changes only on those plaquettes P for which the Si 
{i G P) are not all equal; heuristically, the energy cost is a "surface term". Therefore, 
we conjecture that the idealized embedding algorithm based on ( ^3| ) — with, say, a 
random choice of {6^} — has dynamic critical exponent z ^ 0. Now, the induced 
Ising Hamiltonian corresponding to ( ^.5|) is of the form 



H{{££}) = -J2f^P [Ap^ii^e^^es^ii + {Bp/^t^ei^ea^ + 5 similar terms)] + const 

(5.6) 



The multi-grid method is apphcable to the latter problem, and its performance might 

not be totally disastrous. But the multi-grid method is also applicable to the original SU{N) model 
^ , and it is easy to see that multi-grid updates on the induced XY model are simply a subset 
of the multi-grid updates on the SU{N) model. Thus, at least for the idealized two-grid cycle 
and presumably also for the other cycles, nothing is gained by first embedding U{1) variables. An 
embedding is useful only if there exists an efficient algorithm for simulating the embedded model 
that for some reason does not generalize to the original model. 

In free boundary conditions, such a field {9'^} would be a gauge transform of the identity. 
However, in periodic boundary conditions (i.e. on the torus T"^) there are d additional linearly 
independent possibilities for {6*^}, which give arbitrary values to the d independent Polyakov loops. 
[In fancy language, these solutions are representatives of a basis for the first cohomology group 
i/i(T'';M/Z) ~ (M/Z)''.] 
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where P = {^1,^2,^3,^4} and 



Ap = sm{ee,-9l)sm{ee,-9l)sm{ee,~ei)sm{e,,-9l) (5.7a) 
5p,,,,, = - sin(^^,, - 61) siniOe, - 61) cos{e,, - 91) cos(0,, - 9^) (5.7b) 
etc. 

This is not a Z2 gauge model; rather, it is a Z2 spin model with a curious mixture of 2- 
spin and 4-spin couplings. Unfortunately, we have no idea how to simulate efficiently 
such a model.0 But it would be worth investigating our conjecture that the idealized 
embedding algorithm works well; and if this conjecture is found to be true, then it 
would be worth investigating algorithms for simulating ( p. 61 ). 

One might try generalizing this algorithm to an SU{2) gauge theory, using the 
codimension-1 reflection 

U, -UlUlUl , (5.8) 

where {f//} is a random curvature-free gauge fleld (see Example 4 of Section |3J]) . 
Unfortunately, a global application of ( ^.8|) is not a symmetry of the Wilson Hamil- 
tonian, because 

Retr(f/if/2?73f/4) ^ Retr(f/|f/|f/]f/i) (5.9) 

for a non-Abelian group. Therefore, the induced Ising Hamiltonian would contain 
magnetic-fleld terms that contribute a hulk energy, and the idealized embedding al- 
gorithm would probably not work well. On the other hand, equality in (|5.9|) almost 
holds if the flelds f/i, U2, U3, U4 are close to the identity. Therefore, one might hope 
that for /5 ^ 1 (i.e. near the continuum limit), the idealized embedding algorithm 
corresponding to (|5.8| ) would work well in Landau gauge with the choice = I. 
Of course, to implement this idea in practice one would have to flnd an efficient al- 



gorithm for Landau gauge-fixing — a nontrivial problem [0, |6^, ^ for which the 
conventional algorithms also suffer from critical slowing-down. But the situation may 
not be completely hopeless. (We remark that other collective-mode algorithms, such 
as Fourier acceleration and multi-grid, may also need Landau gauge-fixing in order 
to perform well in the non-Abelian case ||67[| .) 

Let us conclude by mentioning several recent studies of Wolff-type embedding 
algorithms p, ^ |6^, ^ which complement our own. Hasenbusch and Meyer 
studied the three-dimensional XY model a.t (3 < jSc using the codimension-1 Wolff 
embedding with standard SW updates {Nhu = 1); they found Zint,£ ~ 0.46 and 
ZintM^ ^ 0.31. Janke studied the same model using single-cluster SW updates, 
and found Zint,s,cpu ~ 0.25 and Zint,M'^,cPU ~ 0. However, it should be noted that 
these exponents may well be due to critical slowing-down in the inner SW or ICSW 
subroutine; indeed, they are roughly of the same order of magnitude as the dynamic 

It is worth noting that the 4-spin couphngs Apei^ei^ei^si^ can be eUminated in favor of 2- 
spin couphngs, by introducing a new spin ep at the center of each plaquette and couphng this 
spin individually to e^^ , , ££3 , e^^ (analogously to the well-known "star-triangle transformation"). 
The problem is then to find an efficient simulation algorithm for the resulting pair-interacting Ising 
model, which is in general frustrated. 
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critical exponents of the SW and ICSW algorithms for the ordinary three-dimensional 
Ising model [|, 0, |l^, |2^.0 Therefore, it remains possible that the idealized Wolff 
algorithm for the three-dimensional XY model could have ^ ~ 0. In the near future 
we hope to study the idealized algorithm for this model. (Since in this case the SW 
algorithm simulates the induced Ising model reasonably well, albeit not perfectly, it 
will probably not be necessary to go beyond Nhu ~ 50.) 

Jansen and Wiese have very recently studied the two-dimensional CP^ and 



CP^ models, using the codimension-2 embedding z ^ Iiz (see Example 3 in Section 
3A\ above) and single-cluster SW updates. They also studied, for purposes of com- 
parison, a single-site Metropolis algorithm. They found that Zexp,M^ ~ 2 for both 
algorithms. Unfortunately, this fact alone does not constitute evidence for our con- 
jecture regarding the codimension-1 property: the slowing-down observed by Jansen 
and Wiese may well be due to the inability of the ICSW algorithm to update effi- 
ciently the frustrated Ising model that is induced by the codimension-2 embedding. 
It would be necessary to study the idealized embedding algorithm to draw a definitive 
conclusion regarding the merit of this embedding. 

Finally, a recent review talk of Wolff ||6^ contains many interesting ideas, and gives 
some preliminary (negative) results on embedding algorithms for the two-dimensional 
SU (3) principal chiral model. In particular, Wolff's results are consistent with our 
thesis that the codimension-1 and isometry properties must hold if the algorithm is 
to work well. However, as in the work by Jansen and Wiese, no definitive conclusion 
can be drawn except from a study of the idealized embedding algorithm. 
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A Some Topology and Geometry 
A.l Submanifolds and connectedness 

Let X be a connected finite-dimensional metrizableQ C°° manifold, and let be a 
subset of X (usually a submanifold or union of submanifolds). We wish to know 
whether or not X \ S* is connected. 

Our first theorem asserts that a manifold cannot be disconnected by deleting a 
subset of codimension > 2. To make this statement meaningful, we first have to 
define what we mean by "dimension" of an arbitrary subset S (Z X. The appropriate 
notion is provided by a branch of topology called dimension theory |7^, which 



assigns to each separable metric space 5* a dimension dimS' (= —1,0,1, 2,... or oo) 
having the following properties: 

(a) Dimension is a topological invariant, i.e. dimS'i = dim 5*2 if 5"! and S'2 are 
homeomor phic . 

(b) The empty set has dimension —1. 

(c) A nonempty open subset of an n-dimensional manifold (n > 0) has dimension 
n. 

(d) If 5*1 C 5*2, then dimS'i < dim 5*2. 

(e) If S is the union of countably many closed subsets Si, then dimS" = sup dimS'i. 

i 

(It is crucial here that the sets Si be closed. Indeed, it can be proven that any 
n-dimensional space can be decomposed as the union of n + 1 zero- dimensional 
subsets.) 

We then have the following fundamental result: 

Theorem A.l Let X be a connected finite- dimensional metrizahle manifold, and 
let S he a subset of X satisfying dimS" < dimX — 2. (In particular, this holds if S 
is the union of countably many closed submanifolds Si of codimension > 2.) Then 
X \ S is connected. 

■^^ For a connected finite-dimensional C°° manifold X, the following conditions are equivalent: X 
is metrizable; X is separable and metrizablc; X is second countable; X is Lindelof; X is paracom- 
pact; there exists a Riemannian metric on X. (For a proof, see vol. I, p. 271] and ||7^, Theorem 
VIII. 6. 5]. See also [|7^, p. 207].) The purpose of imposing this condition is to exclude patholog- 
ical examples such as the "long line" |7|, Example 45], which are locally Euclidean |7^, Problem 
3.12.18(b)] and even admit a differential structure ItI, p. 15], but are globally "too big". 
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Proof. See p. 48, Corollary 1] or |7|, p. 80, Theorem 1.8.19]. 



This can be rephrased as: if X \ S" is disconnected, then S must have codimension 
or 1. In particular, if X \ IJi Si is disconnected and the Si are closed, then at least 
one of the Si must have codimension or 1. 

Next we wish to study theorems going in the opposite direction, i.e. asserting that 
X \ 5* is disconnected. The naive converse of Theorem [A.l| is false: 



Example 1. Let X be the torus T^, and let S be the "slice" T^~^ x {a} for some 
a G T^. Then S has codimension 1, but X \ S" is connected. 

Example 2. Let X be the real projective space RP^~^ = S^~^ and let S be 
the equator in X. Then S has codimension 1, but X \ S* is connected. 

The key fact in both of these examples is that the manifold X is not simply 
connected. Indeed, if we assume that X is simply connected, then we can prove a 
converse to Theorem lA.ll: 



Theorem A. 2 Let X he a simply connected finite- dimensional metrizable C°° mani- 
fold, and let S be a closed codimension- 1 submanifold (without boundary) of X. Then 
X \ S is disconnected. 

Proof (explained to us by Sylvain Cappell). Fix a point p & S, and let U he a small 
open neighborhood of p. Let 7: [0, 1] f/ be a smooth curve that intersects S exactly 
once, doing so transversally at p. Then go = 7(0) and qi = 7(1) are points in U \ S 
"on opposite sides of S" . Now, if XyS* is connected (and hence path-connected), then 
there exists a smooth curve 7 in X \ S* running from qi to go- In that case a = 7 o 7 
is a loop in X, which intersects 5* exactly once. By a slight modification near go and 
gi, we can assume that a is smooth. 



Now recall the basic ideas of intersection theory mod 2 [[79| , Section 2.4]: Let M 
and X be finite-dimensional metrizable C°° manifolds, with M compact, and let S 
be a closed submanifold of X satisfying dimM -|- dimS" = dimX. If /: M — X is a 



smooth map that is transversal to S (see Section |A.2| for the precise definition), we 
define hif, S) to be the cardinality of /"^[S"] mod 2. A fundamental theorem states 
that if /o, fi are homotopic and are both transversal to S, then /2(/o, S) = /2(/i, S). 

To apply this theory, we let M = and f = a. By construction /2(a, S) = 1. 
On the other hand, since X is simply connected, a is homotopic to a constant map 
/?: S^ ^ X, where the constant can be chosen to be ^ 5"; so hiP, S) = 0. But this is 
a contradiction. ■ 



Remarks. 1. For 5* compact and connected, this proof can be found in p4 , 
Theorem 4.4.6]. 

2. A fancier way of phrasing this proof is to use the language of homology theory. 
There is a natural bilinear map Hi{X; Z2) x if„_i(X; Z2) —>■ Z2 (where n = dimX), 
called the "Poincare duality" or "intersection pairing" Theorem 65.1]: if a is a 
loop and S is an (n — l)-dimensional submanifold, then [a] ® [S] counts (mod 2) the 
number of times that a intersects 5*. For our loop a, this intersection number is 1, 
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so [a] is a nontrivial element of Hi{X; Z2). But this yields a contradiction if the first 
homology group of X mod 2 is trivial [i.e. Hi{X; Z2) = 0]. Now, since Hi{X; Z) is the 
quotient of the first homotopy group vri(X) by its commutator, simple connectedness 
implies the triviality of iJi(X;Z) and hence of Hi{X; Z2); but the latter condition 
is weaker. For example, the lens spaces L{n, k) are 3-dimensional manifolds having 



Hi{X; Z) = Zn 173, pp. 238-243]; so for n odd, we have Hi{X; Z2) = but tti{X) ^ 0. 
For an even more extreme example, let G be an arbitrary finite simple group, and 
let X be a compact polyhedron such that 7ri(X) = G |7^, Theorem 6.4.6]; then 
Hi{X;Z) = H^{X;Z2) = 0. 

If X has nontrivial first homology mod 2, then removing a single codimension-1 
submanifold may not disconnect X, as the two preceding examples show. Never- 
theless, by removing several codimension-1 submanifolds we can disconnect X. For 
simplicity, we restrict attention to the case where X is compact. 

Theorem A. 3 Let X be a compact connected n- dimensional metrizable C°° man- 
ifold, and let k = ranki7i(X; Z2). Let Si, . . . , Si be disjoint closed codimension-1 
submanifolds (without boundary) of X . Then X \ \Ji=i Si has at least I — k + 1 con- 
nected components. In particular, ifl>k + l, then X \ Si is disconnected. 



Proof. We imitate the proof of the Alexander duality theorem ^7\, Theorem 71.1]. 
Set A = \Ji=i Si, and let /' > / be the number of connected components of A. Then, 
by the Poincare duality theorem ||7^, Theorem 65.1], the [n — 1)'^* cohomology group 
mod 2 of A has rank i.e. H'^''^{A; Z2) ~ Ho{A; Z2) ~ Z!^. On the other hand, 
by Poincare duality we have H^~^{X; Z2) — Hi[X; Z2) — Z\. Now there is an exact 
sequence 

E'\X- Z2) ^ i/"(X, A- Z2) ^ H^'-^A; Z2) ^ i/"-^(X; Z2) (A.l) 

where i: A ^ X and j: (X, 0) {X,A) are inclusions, and 5* is the cohomology 
coboundary homomorphism |l77| . Theorem 43.1, compare Theorem 23.3]. So 5* induces 
an isomorphism 

kerj* ~ H''-\A;Z2)/i*[H''-^{X;Z2)] ~ Zl^ (A.2) 

where m>l' — k>l — k. 

Now let r(2) be the unique nonzero element of if"(X; Z2) — Z2 (also called an 
"orientation class for X over Z2")] then j*r(2) is a nonzero element of if"(X, A; Z2). 
Let k: X \ A ^ X be inclusion. Then by Poincare duality [T^, Theorems 67.1 and 



67.2] and Lefschetz duality [^, Theorem 70.6] we have the diagram 

H^iX,A;Z2) H^iX;Z2) 

\n>r(2) |nr(2) (A.3) 

Ho{X\A;Z2) ^ Ho{X;Z2) 
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where 0* is the Lefschetz duahty isomorphism, nr(2) is the Poincare duahty isomor- 
phism, and the diagram commutes up to sign. Therefore, 



ker j* ~ ker/c* . (A. 4) 

On the other hand, by exactness of the sequence 

Ho{X\A) Ho{X\A) ^ Ho{X) (A.5) 
||77| , exercise 71.1], we have 

kerA;, ~ HoiX\A) . (A.6) 
Combining these isomorphisms, we conclude that 

HoiX\A) ^ Z^, (A.7) 
i.e. X \A has m + l>/ — A;+l connected components. ■ 

Remarks. 1. In the foregoing proof it is not necessary that Si, . . . , Si be disjoint; it 
suffices that they be "hnearly independent mod 2", in the sense that rankif"'~^(lj'^^ 

S^■, Z2) > I. 

2. Is the bound / > A; + 1 best possible? We suspect that it is best possible under 
the hypothesis rankif"~^(lJi=i -Sj; Z2) > /, but that it is not best possible under 
the stronger hypothesis that Si, . . . , Si are disjoint. Indeed, if X is an orientable 
surface (= 2-dimensional manifold) of genus g, then k = rankifi(X; Z2) = 2g; and 
while 2g + 1 independent circles (= codimension-1 submanifolds) may be needed to 



disconnect X, only g + 1 disjoint circles are needed to disconnect X Exercise 
9.3.17 and Theorem 9.3.6]. 

3. If X is a connected manifold and S" is a connected closed codimension-1 subman- 
ifold, then X \ S is either connected or else has exactly two connected components 
Iff^ , Lemma 4.4.4]. Thus, each disjoint "cut" creates at most one new connected 
component. 

A. 2 Trans versality and genericity 

Let X and Y be finite-dimensional metrizable manifolds (1 < r < 00), let /: X — > 
y be a C" map, and let Z he a co dimension- submanifold of Y. We wish to show 
that "generically" the set f~^[Z] = {x: f{x) G Z} is a submanifold of codimension 
k m. X. The appropriate tool is a branch of differential topology called transversality 
theory \Ti, |T9|, 30|. We say that / is transversal to Z (denoted fffiZ) if, for every 



X E X, either 

(a) fix) i Z- or 

(b) {Dxf)(TxX) + Tf(^x)Z = Tf(^x)Y, i.e. the image of the tangent space T^X under 
the hnear map D^f contains a subspace of that is complementary to 
Tf(x)Z. 
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(For a nice intuitive discussion, with pictures, see [|79|, pp. 27 ff.].) Transversality is 



just the right condition we need to control the inverse image / 

Theorem A. 4 Let X,Y he finite- dimensional C"' manifolds (1 < r < oo), let Z 
be a codimension-k C" submanifold of Y , and let f: X ^ Y he a C' map that 
is transversal to Z. Then f~^[Z] is either empty or else a C^' submanifold (not 
necessarily connected) of codimension k. Moreover, if X is Lindeldf (resp. compact) 
and Z is closed, then f~\Z\ has only countably many (resp. finitely many) connected 
components. 



Proof. See Theorem 1.3.3], p. 28], [|1|, p. 24] or g, pp. 45-46]. 



Now the point is that transversality is generic: 

Theorem A. 5 LetX,Y be finite- dimensional metrizable manifolds (1 <r <oo), 
and let Z be a submanifold ofY. Then the set of maps 

Az = {f e C'{X,Yy. f IS transversal to Z} (A.8) 

contains a dense Gs subset of C^{X,Y) in the compact-open topology (also called 
the "weak topology"). In fact, if X is compact and Z is closed, then Az is a dense 
open subset of C^{X,Y). 



Proof. See [|7^, Theorem 3.2.1]; other references are ||8T|, pp. 25-26] and pp. 
46-50]. I 



Putting together Theorems [A.4| and |A.5| , we conclude that for "almost all" maps 



/, the inverse image f~^[Z] is either empty or else a submanifold of codimension k. 

We cannot exclude the possibility that f^^[Z] is empty: indeed, it is perfectly 
possible for the image f[X] to avoid completely the submanifold Z. However, the 
point is that in physical applications there will be a nonzero probability for f[X] to 
intersect Z. We could go on to formahze this idea: we would assume that a compact 
Lie group G acts transitively on Y, and we would seek to prove that under appropriate 
conditions the set 

{/ G C'iX,Yy. fiHaariig G G: {q o f)-'[Z] ^ 0}) > 0} (A.9) 

contains a dense Gs subset of C"'(X, F). But we are physicists, not mathematicians, 
and enough is enough. 

Let us summarize the upshot of all this topology for our physical application. 
Consider a a-model with x-space X and target space M, and let Z he a closed 
codimension-1 submanifold of M. We reason as follows: 

(a) If M is simply connected, then Theorem [A.2| implies that M\Z is disconnected. 



With "high probability" one expects the field cr to intersect more than one connected 
component of M \ Z. In this case the set X \ a^\Z\ is disconnected. 
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(b) For arbitrary M, Theorems |A.4 and A. 5 and the subsequent remarks imply 



that with "high probabihty" cr ^[Z] is a nonempty closed codimension-1 submanifold 
of X. If X is simply connected, Theorem |A.2| then implies that the set X \ (t~^[Z] is 



disconnected. 

(c) If neither M nor X is simply connected, then one uses Theorem |A.3| in place 



of Theorem |A.2| . One expects the probability of the desired event to be smaller, but 
still nonzero (uniformly for 1 <^ ^ < L). 

A. 3 Fixed points of isometries 

Next we discuss the fixed-point set for isometries of a Riemannian manifold. 

Theorem A. 6 Let M be a finite- dimensional Riemannian manifold, and let S be 
any set of isometries of M. Let F be the set of points of M which are left fixed by all 
elements of S. Then: 

(a) F is a closed set. 

(b) Each connected component Fi of F is a closed totally geodesi(^ submanifold of 
M. 

(c) For each Fi, there exists an open set Ui D Fi that intersects none of the other 
connected components of F. 

(d) If M is Lindeldf (resp. compact), there are at most countably many (resp. finitely 
many) Fi. 



Proof, (a) is trivial, since an isometry is necessarily continuous, (b) is |82|, p. 59, 
Theorem II. 5.1], and (c) is implicit in the proof given there. To prove (d), consider 
the open cover of M consisting of the {Ui} together with V = M \ F . Since M 
is Lindelof (resp. compact), there exists a countable (resp. finite) subcover; but by 
definition of Ui, this subcover must include all of the {Ui}, since otherwise it couldn't 
cover all of F. ■ 



Some partial converses to Theorem |A.6| (b) are mentioned in Further Remarks 1 
and 2 at the end of this section. 

The following lemma shows that isometries are completely determined locally (it 
is analogous to analytic continuation of holomorphic functions, but even stronger): 

Lemma A. 7 p. 62, Lemma LI 1.2] Let M be a connected finite- dimensional 
Riemannian manifold, and let (p and ip be isometries of M. Suppose that there exists 
a point p & M for which (p{p) = ipip) '^'^'^ {Df)p = {D'^)p- Then (f = ip. 

A submanifold C M is called totally geodesic at a point x ^ N ii each Af-geodesic which is 
tangent to A'^ at x lies in A^. The submanifold A^ is called totally geodesic if it is totally geodesic at 
each of its points. 
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It follows immediately that a fixed-point manifold of codimension can occur only 
in the case of the identity map: 

Theorem A. 8 Let M be a connected finite- dimensional Riemannian manifold, and 
let if he an isometry of M. If the set of fixed points of ip contains a nonempty open 
set (i.e. a suhmanifold of codimension 0), then is the identity map (and hence the 
fixed-point set is all of M). 

By a slightly more subtle argument, we can show that if the fixed-point manifold 
has codimension 1, then the map must be involutive: 

Theorem A. 9 Let M he a connected finite- dimensional Riemannian manifold, and 
let if he an isometry of M. If the set of fixed points of (f contains a suhmanifold of 
codimension 1, then if is involutive (i.e. is the identity map). 



Proof. Let N C Fix(y9) be a suhmanifold of M of codimension 1, and let p & N. 
Then in some small neighborhood U 3 p we can choose local coordinates (xi, . . . , x„) 
such that U n N is given by Xi = 0. In this basis, {D(p)p has the form 







V2 ••• Vn \ 








' J 


(A.IO) 



Since {Dip)p leaves invariant the metric tensor Gp [i.e. {Dip)pGp{Dip)p = Gp] and Gp 
is nondegenerate, it follows that det{Dip)p = ±1, i.e. vi = ±1. We now claim that 
{D^)l = I- 

(a) If f 1 = then we must have V2 = ■ ■ ■ = Vn = 0, because a matrix that leaves 
invariant a positive-definite quadratic form must be diagonalizable (over C), i.e. it 
must not have a nontrivial Jordan block. Then {Dip)p = I. 

(b) If = -1, then 



/ 



V 







{Vi + 1)V2 



I 



J 



I . 



(All) 



(Alternatively, we can argue that there exists a change of basis setting V2 = ■ ■ ■ = 

Vn = 0.) 

Thus, in either case, ip^ is an isometry satisfying (p^{p) = p and {Dip^)p = {D{p)p = 
I. Lemma A. 7 then implies that cp"^ is the identity map. [In case (a), (p is itself the 
identity map, while in case (b) it is not.] ■ 

Remark. This theorem seems to be well known to differential geometers, but we 
have been unable to find a published reference. Some vaguely related theorems, which 
may conceivably be of interest in future generalizations of the embedding method, 
are: 
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a) Let iS be a one-parameter group of isometries (or more generally, a connected 
abelian Lie group of isometries) of a finite-dimensional Riemannian manifold M. Then 
each connected component of Fix(iS) has even codimension |Q ||8^, p. 60, Theorem 
II.5.3]. 

b) Let T: M M he a smooth map which is periodic of odd period p (here 
p = min{g: T'^ = identity}). Then each connected component of Fix(4S) has even 
codimension MM. Note that T need not be an isometry; this theorem is purely 



algebraic and topological. For related material, see also ||85 |. 
Further Remarks. 1. Vanhecke and collaborators 



se, 37, 



89, 90, 91 have re- 



cently investigated reflections in Riemannian manifolds from a point of view opposite 
(but complementary) to ours. We start from an involutive isometry and seek to study 
its fixed-point manifold. They start, by contrast, from a sub manifold C M, and 
define the local reflection ifj^ about A^; this local reflection is automatically involu- 
tive, and they ask under what conditions it is a local isometry. In view of Theorem 
|A.6| (b) [or more precisely its local analogue], a necessary condition for ipjy tohe a. local 
isometry is that be totally geodesic. It is natural to ask whether this condition is 
sufficient. Vanhecke et al. prove the following interesting theorem ( []86| , Theorem 5.7] 
and [^, Corollaries 4(a) and 5]), which is somewhat reminiscent of Theorem T^: Let 
M be a Riemannian manifold. Then the following are equivalent: 

(a) For each geodesic curve (= totally geodesic 1-dimensional submanifold) N C 
M, the local reflection is a local isometry. 

(b) For each totally geodesic submanifold C M, the local reflection ip^ is a 
local isometry. 

(c) M is a space of constant curvature. 

2. Another partial converse to Theorem |A.6| (b) is due to Chen and Nagano 



Theorem 3.1]: In the manifold Q„ = SO{m + 2)/{S0{m) x ^0(2)) for m > 2, a 
complete connected submanifold A^ C Qm is totally geodesic if and only if it is a 
connected component of the fixed-point set of some finite set of involutive isometries 
of Qm- It would be interesting to know to which other symmetric spaces, if any, this 
result extends. Some totally geodesic submanifolds A^ C Qm of codimension m have 



been found by Nikic 

3. In Example 3 of Section |3.1| , we proved that in CP^'^^^ {N > 3) there are 
no isometries having a fixed-point manifold of codimension 1. Wolf ||9^ has shown 
much more: in are no closed totally geodesic submanifolds of 

codimension 1. Moreover, the same holds for quaternionic projective space QP""-'. In 
fact. Wolf Theorem 1] obtains a complete list of the totally geodesic submanifolds 
of ^^-1, /?P^-i, CP^-\ QP^-' and CayleyP\ 



A. 4 Codimension and frustration 



Here we prove some theorems mentioned in Section |2.3| , regarding the relations be- 
tween codimension and frustration. Theorem A. 10 and Corollary |A.11| state that 
non- frustration implies codimension 1. Corollary |A.12| shows further that in the case 
of an irreducible symmetric space, non-frustration occurs only in the codimension- 1 
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algorithm for the A^-vector model — i.e. the original Wolff algorithm. 

Theorem A. 10 Let M be a finite-dimensional Riemannian manifold with metric 
tensor g, and let T be an involutive isometry of M. Let E: M x M M. be a 
function satisfying E{(t, cr') = a + 6(i(cr, cr'Y + o{d{cr, cr'Y) as cr' cr , where a G M, 
6 > 0, and d is the geodesic distance on M . Define 

J{ct,(t') = E{(T,Ta') - E{(t,(t') . (A.12) 

Now let cr* G Fix(T). Suppose that there exists an integer m > 3 and a neighbor- 
hood U 3 a* such that for all cti, . . . , cr^, cr^+i = cti ^ U we have 

m 

l[J{a„a,+,) > 0. (A.13) 
Then the connected component o/Fix(T) containing cr* has codimension 1. 



Proof. Use a chart on U such that cr* = (by abuse of language we identify a point 
in U with its coordinates given by the chart) and g^v{cr*) = 5^^. Then the linear 
map {DT)„* is represented in these coordinates by an orthogonal matrix T satisfying 
T^ = I. Thus, T is symmetric with eigenvalues ±1, and the number of negative 
eigenvalues equals the codimension of the connected component of Fix(T) containing 
cr*. Finally, J is given in these coordinates by 



J(o-,o-') 



di(T,T(T'Y -d{(T,(T'f] + o[d{(T,(T'Y,d{a,T(T'y) (A.14a) 



r\2 



J\2 



2 /2\ 



+ o{cr'\a''') 



2bcT ■ (/ - T)cr' + o(<T^ (T 



i2\ 



(A. 14b) 
(A.14c) 



Now, if rank(/ — T) > 2 and m > 3, it is easy to choose cti, . . . , cr^ with arbitrarily 
small magnitudes such that CTj ■ (/ — T)crj+i > for i = 1, . . . , m — 1 and cr^ ■ 
(/ — T)cri < 0. [Using polar coordinates in some fixed two-dimensional subspace of 



Ran(/ — T), let CTj point at angle 6j = {j — l)0/(m — 1) for j = 1, 
TT < < 37r/2.] This proves the theorem. ■ 



, m, where 



Corollary A. 11 Let M , T , E and J be as in Theorem \A.1(\ . Suppose that for each 
cr* G Fix(T) there exists an integer m„* > 3 and a neighborhood U„* 3 cr* such that 
for all cri, . . . , cr^, crm+i = cri E U^* we have 

m 

X{J{CT,,CT,+,) > 0. (A15) 

i=i 

Then either 
(a) Fix(T) = 
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or else 

(h) every connected component o/Fix(T) has codimension 1. 



Proof. Immediate. I 



Corollary A. 12 Let M , T, E and J be as in Theorem \A.l(\ , and assume further that 



M is an irreducible compact Riemannian symmetric space of dimension n. Suppose 
that for each cr* G Fix(T) there exists an integer m„* > 3 and a neighborhood U^* 3 
cr* such that for all cri, . . . , cr^, cr^_,_i = cti G U^j* we have 

m 

l[Jiaj,aj+,) > 0. (A.16) 
j=i 

Then either 

(a) Fix(T) = 
or else 

(b) M is isometric to S"^, and under this isometry T is the codimension- 1 reflection 
cr — ^ Iicr. 



Proof. If Fix(T) ^ 0, then by Corollary |A.11| each connected component of Fix(T) 
must have codimension 1. By Theorem p.3| , M must be isometric to either 5""^ or 
i?P". But in Section lO] (Example 2), we classified the involutive isometrics of i?P": 
in particular, we found that for n > 2, every involutive isometry with Fix(T) ^ 
has at least one connected component of Fix(T) of codimension > 2. So M must 
be isometric to S^, and the theorem follows from our classification of the involutive 
isometrics of (Section |3.1|, Example 1). ■ 



B Involutive Isometries of SU{N) 

Let G be a compact simple Lie groupQ, with Lie algebra g. Then there is a unique (up 
to constant multiples) Riemannian metric on G that is invariant under both left and 
right translations; at the identity element e E G this metric is given by the negative 
of the Killing-Cartan form 

B{X,Y) = tr(adXadr) for X, F G g (B.l) 

We make the usual abuse of language, and call a Lie group "simple" if its Lie algebra is simple 
(i.e. has no nontrivial ideals). Of course G need not be simple in the group-theoretic sense: for 
example, SU{N) has a nontrivial (but discrete) center, isomorphic to Zn- 
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(see [0, pp. 57-58, Lemme 5]). We shall always consider G to be equipped with this 
unique bi-invariant Riemannian metric. 

For G = SU{N), the Lie algebra g = su{N) is the space of all traceless antiher- 
mitian N x N matrices, and the Killing-Cartan form is 

B{X,Y) = -const X Re tr(X^r) for X, F G su(A^) . (B.2) 



The isometry group of SU{N) has been determined by Cartan [95] (see also Wolf |^ 
sees. 2.4 and 4.1.2]): 

Theorem B.l / |g3| / The isometries of SU{N) are the following. ■ 

(a) A — > UAV 

(b) A — > UAW 
(a) A — > UAV 
(d) A — ^ UA^V 

for U,V & SU (N) . For N > 3, two isometries in this list are equal if and only if they 
belong to the same class (a), (b), (c) or (d) and in addition their determining matrices 
{U, V) and {U' , V) satisfy U' = CU , V = C-^V for some C eC = center of SU{N). 
For N = 2 the same statement holds provided that we consider only the classes (a) 
and (b). [For N = 2 the classes (c) and (d) are redundant because A = JA[—J) and 

A^ = JA^{-J), where J = zt^ = J ^ and ±J G SU{2).j 

Our next task is to classify these isometries modulo conjugacy: 

Theorem B.2 Two isometries of SU{N) are conjugate if and only if they belong 
to the same class (a), (b), (c) or (d) [for N = 2, (a) or (b)] and in addition their 
determining matrices {U, V) and {If, V) are related in one of the ways (i)-(iv) listed 
below: 

Class (a): 

(i) U' = CXUX-\ V = C-^Y-^VY 

(ii) u' = cxv^x~\ v = c-^y-^Wy 

(ill) U' = CXUX-\ V = C-^Y-^VY 
(iv) U' = CXV^X~\ V = C-^Y-^U^Y 

Class (b): 
(i) U' = CXUY, V = C'^XVY 
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(ii) U' = CXV^Y, V = C-^XWY 
(m) U' = CXUY, V = C-^XVY 
(iv) U' = CXV^Y, V = C-^XU^Y 

Class (c): 

(i) U' = CXUX'^, V = C-^Y^VY 

(ti) U' = CXV^X^, V = c-'^y^Wy 

(m) U' = CXUX^, V = C-^Y^VY 
(iv) U' = CXV^X^, V = C-^Y^U^Y 

Class (d): 

(i) u' = CXUY, V = C-^XVY 

(ii) U' = CXV^Y, V = C~^XWY 
(ill) U' = CXUY, V = C'^XVY 
(tv) U' = CXV^Y, V = C-^XU^Y 

In all cases X,Y e SU{N) and C e C = center of SU{N). 

Proof. Consider an isometry / of class (a), say f{A) = UAV, and let us compute 
g o f o g^^ with all possible isometrics g: 

(i) giA) = XAY =^ (gofo g-^){A) = {XUX-')A{Y-'VY) 

(ii) giA) = XA^Y =^ {gofo g-^){A) = iXV^X-^)A{Y-^WY) 

(iii) g{A) = XAY =^ {gofo g-^){A) = iXUX-')AiY-'VY) 

(iv) g{A) = XA^Y =^ {gofo g-^){A) = {XV^ X'^)A{Y~^U^Y) 

The claim then follows from Theorem p.l| . Analogous computations handle the cases 
when / is an isometry of class (b), (c) or (d). ■ 

Next we determine which isometrics are involutive: 

Theorem B.3 An isometry of SU{N) is involutive if and only if: 
Class (a): U^ = V-^ eC = center of SU{N). 
Class (h): UV-^ G C. 
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Class (c): Either (1) U = U^ andV = 

or (2) U = —U'^ and V = —V^ and N is even. 

Class (d): Either (1)U = V 

or (2) U = —V and N is even. 



Proof. We compute two successive applications of the given isometry, and demand 
(using the last part of Theorem p.l|) that it be equal to the identity map. 

Class (a): A — > UAV — > U{UAV)V. So we need U"^ = C , = C'^ with 
C eC. 

Class (h): A — > UAW — ^ U{UAWyV = {UV^)A{WV). So we need UV^ = 
C, WV = C^^ with C E C. The two conditions are equivalent; they amount to 
U = CV = VC, i.e. UV-[ e C. _ 

Class (c): A — > UAV — ^ UUAVV = {UU)A{VV). So we need UU = C, 
VV = with C e C. Equivalents we need U = CU^, V = C-^V^. But then 
U = CU^ = C{CU^f = CC^U; and since the central elements of SU{N) are 
multiples of the identity matrix, we have C = C'^; so we need = I. But this 
means that C = / if is odd, or C = ±J if is even. 

Class (d): A — > UA^V — > U{UA^VfV = {UV^)A{U_^V). So we need UV^ = 
C, U^V = C-^ with C eC. Equivalents we need U = CV,V = Q-^U . But for a 
central element of SU (N), C^^ = C, so we can write the second equation as = CU, 
hence V = CU. But then we have U = C^U, so C^ = I. It follows that C = / if 
is odd, or C = ±/ if is even. ■ 

Next we classify the involutive isometries modulo conjugacy: 

Theorem B.4 Every involutive isometry of SU{N) is conjugate to one of the fol- 
lowing: 

(ar^s) A — y IrAIs with r + s even, r < s and r < N/2. 
Here Ir = diag( -l, . . . , -1 , +1, . , +1 ). 

r times N — r times 

(hi) A — > At. 

(h2) [Only for N even] A — > e^^''^ A^ . 

(cl) A — ^A. 

(c2a) [Only for N even] A — > -JAJ. 

(c2h) [Only for N = ik, k integer] A — > e^'^'/^JAJ. 

(c2c) [Only for N = Ak + 2, k integer] A — > jAj. 

(dl) A — >A^. 
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(d2) [Only for N even] A — > -A^ . 

Moreover, none of the above isometries [for N = 2, none of the above isometries of 
classes (a) and (b)] are conjugate one to another. 

Proof. Class (a): The map A — > UAV is involutive iff [/^ = 1/"^ = C = 
^2mm/N It then follows that the eigenvalues of U (resp. V) are all ie'^*™'/^ (resp. 
_l_g-7rjm/Af^^ SO the matrices U and V can be diagonalized as U = X{e'^^"^^^ Ir)X~^, 
V = F-i(e-'^*'"/^/,)F with X,Y e SU{N) and < r, s < N. Note that m, r and s 
must all have the same parity modulo 2 (since det U = det V = +1), so in particular 
r + s must be even. By Theorem |B.2|(a)(i) it follows that the map A — > UAV is 



conjugate to A — > (e^^'"/^J,.)A(e-^^"'/^/,) = I,AJ,. By Theorem iKa)(ii) [or (iv)] 
we can take r < s. Trivially IjAIg = In-t^In-si so we can take r < N/2. Finally, it 
follows from Theorem p.2| (a) that there are no further conjugacies between members 
of this class. 

Class (b): The map A — ^ UAW is involutive iff f/ = e'^^^'^'^V. By Theorem 
R|(b)(i) with X = e-'^^^^'^V~^ and Y = I, this map A — > (,-i^i^/Ny jg 



jugate to A — f g27ri(m-2fc)/Af^t_ j£ jg odd, the integer k can be chosen so that 
m — 2k = (mod A^); if is even, k can be chosen so that m — 2k = or 1 
(mod A^). Finally, it follows from Theorem [B.2| (b) that if is even, these two alter- 
natives are not conjugate one to another. 

Class (cl): Consider the map A — > UAV with U = U"^ , V = V'^. We can 
write U = R + iS where R and S are real symmetric matrices. Expanding out 
UU^ = WU = /, we conclude that RS = SR and R^ + S'^ = I. It follows that we 
can simultaneously diagonalize R and S using a real rotation matrix L G SO{N): 

R = Ldiag(Ai,...,A7v)i^^ (B.3a) 
S = L diag(/ii, . . . , /iAr) (B.3b) 

with \i, fii real and \f + fif = 1. Hence 

U = Ldiag(e*^\...,e*^'^)L'^ 

= LDL'^ (B.4) 

with L e SO{N) and D a diagonal matrix in SU{N). Now let E = diag(±e*^i/^ 
e*^^/^, . . . , e*^^/^), with the sign chosen so that E E SU{N)] we have shown that 
U = XX^ where X = LE e SU{N). Similarly, V = Y^Y with Y G SU{N). It 
follows from Theorem |B.2| (c) (i) that the map A — > UAV is conjugate to A — > A. 



Class (c2): Consider the map A — > UAV with U = -U^ , V = -V^ and A^ 
even. We can write U = R + iS where R and S are real antisymmetric matrices. 
Expanding out UW = U^U = /, we conclude that RS = SR and R^ + S^ = -I. It 
follows from Lemma |B.5| below that R and S can be simultaneously brought into real 



Schur form by a real rotation matrix L G SO{N): 

R = Lblockdiag(AiJ, . . . , AAr/2J) (B.5a) 
S = Lblockdiag(/iiJ, . . . , /iAr/2J) iv"^ (B.5b) 



61 



with J ~ 1^ I g J ' -^i' A^j ^6^1 ^^"^ -^i + /^i = 1- Hence 

U = Lblockdiag(e'^iJ,...,e*^^/2j)L'^ 

= LDJL'^ (B.6) 

with L e 50(A^), D = diag(e*^^ e^^S . . . , e'Va, e^^^) ^ SU{N), and J = blockdiag(J, . 
Now let E = diag(e*^i/2^ e'''^/^, e'^^/^/^, g^^^/^/^); either E E SU{N) or else e'^^/^E e 
SU{N), according as deti? = ±1. We have then shown that either 

U = XJX^ with X = LE e SU{N) (B.7) 

or else 

U = e'^^'/^XJX^ with X = e'^'l^ LE e SU{N) . (B.8) 

And obviously the same holds for V. Thus, by Theorem p.2| (c) (i) we have shown that 
every isometry of class (c2) is conjugate to either A JAJ or ^ — > e^'^^/^ J AJ . 

We now show that these two isometries are not conjugate one to the other. We 
firstly prove that there do not exist X, F e SU{N) and C = cl E center of SU{N) 
such that J = CXJX'^e^'^^/^ and J = C^^Y JY^ . Indeed, if such matrices exist, then 
CV\-i/Nx and C~^/^Y are symplectic matrices and as such must have determinant 
+ 1 gl], pp. 347-349]. But dei{C^/'^e'''l^ X) = -c^l^ and det(C-V2y) = c'^/^ = 
c^/^, which is a contradiction. It easily follows, by using Theorem p.2| (c), that the 
two isometries are not conjugate one to another. 

It is convenient to make now a slightly different choice of representatives from 
the two conjugacy classes. Note that when when N = Ak {k integer), the isometry 
A — »• JAJ is conjugate to A —jAj [take X = il and F = / in Theorem |B . 2| ( c) ( i )] , 
while for iV = 4A; + 2 the isometry A — > e^'^^/^ JAJ is conjugate to A — > — JAJ [take 
X = e27ri(7v-2)/4Af y _ jj_ xhus, in either case we can choose A — JAJ as one 
representative, while for N = Ak (resp. = 4A; + 2) we take A e^'^^^^ jAj (resp. 
A JAJ) as the other. 

Finally, we prove that isometry (cl) is not conjugate to (c2a), (c2b) or (c2c). 
By Theorem [B.2| (c), such a conjugacy would imply the existence of X G SU{N) 
and C e C such that XJX^ = C. But then X' = C-^I^X e V{N) would satisfy 
X' JX'^ = I, or equivalently X'J = X' . Taking complex conjugates, we obtain 
X'J = X' (since J is real). Then X' J^ = JCJ = X\ which implies that = /. But 
in fact J^ = —I, so we have a contradiction. 

Class (d): The map A — > UA^V is involutive iff = ±f/ (with the — sign 
allowed only if N is even). By Theorem [B.2| (d)(i) with X = U^^ and y = /, this 
map A — > ±UA'^U is conjugate to A — > . Finally, it follows from Theorem 
B.2| (d) that if N is even, these two alternatives are not conjugate one to another [if 

xy = cgc, then xy = c' = c-V-C"^] • ■ 
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Lemma B.5 Let Ai, . . . ,Ak be commuting real antisymmetric NxN matrices. Then 
there exists a real rotation matrix L G SO{N) such that for each i = 1, . . . ,k, 



j^T^ j^ ^ f blockdiag(Ai,i J, . . . , Ai,Ar/2J) if N is even ,^ . 

* 1 blockdiag(Aj,iJ, . . . , Ai,(Ar_i)/2J, 0) if N is odd 

where the Xij are real numbers. 

Proof. Since Ai, . . . ,Ak are commuting anti-hermitian matrices, they have a com- 
mon eigenvector v G C^: 

AiV = iXiV for each i , (B.IO) 
where the Aj are real. Since the Ai are real, we have also 

AiV = —iXiV for each i . (B.ll) 

There are now two cases: 

(a) If the real and imaginary parts of v are linearly independent (i.e. v is not a 
multiple of a real vector), then w+ = [v + v)/\\v + and = [v — v)/i\\v — v\\ are 
perpendicular unit vectors in M^, and 

AiW+ = — AjW__ (B.12a) 
AiW- = XiW+ (B.12b) 

[If at least one of the Aj is nonzero, then V-Lv and hence ||f + v\\ = \\v — v\\, so 
the normalizations work out right. If all of the Aj are zero, then (|B.12[ ) is trivial.] 



Moreover, the subspace of orthogonal to w_} is invariant under each operator 

A. 

(b) If f is a multiple of a real vector w (which we take to be of unit norm), then 
all the Xi must be zero. Moreover, the subspace of orthogonal to w is invariant 
under each operator A^. 

We now continue the same process of reduction, working on the operators A^ 
restricted to or {w}'^. In this way we produce an orthonormal basis 

{tfi, . . . , wn} e consisting of all the vectors w± or w generated at each step; we 
order this basis so as to put first all pairs w±, then all individual vectors w. The 
columns of the desired matrix L are given by this basis. (If necessary, wi and W2 can 
be interchanged so as to guarantee detL = +1.) | 



Finally we determine the fixed-point manifolds for each of the involutive isometrics 
listed in Theorem IB. 4: 



Theorem B.6 The fixed-point manifolds F of the involutive isometrics listed in The- 



orem \B.4 are: 
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(ar,r) ^ — Ir^Ir- F — matrices of the form ^ ^ with B e U{r), C e U{N — r) 

and (det S) (det C) = 1. T/izs is the connected symmetric space S{U (r) xU{N — 
r)); it is a subgroup of SU{N) of codimension 2r{N — r). 

(ar,s) A — > IrAIs With r ^ s: F = 0. 

(hi) A — > Ah F is a disjoint union of components F,f. = {Ulr-W: U G SU{N)} for 
r even (0 <r < N). The manifold F^ is the symmetric space SU{N)/ S{U{r) x 
U{N — r)); it has codimension r^ + {N — r)^ — 1. 

(b2) [N even] A — > e^^'/^A^ : F is a disjoint union of components F,. = {e^'^^^UIr-W: U G 
SU{N)} for r odd (0 < r < N). The manifold F^ is isometric to the symmetric 
space SU{N)/S{U{r) x U{N - r)); it has codimension r^ + {N - rf - 1. 

(cl) A — > A: F consists of real matrices A G SO{N). This is a connected symmet- 
ric space; it is a subgroup of SU{N) of codimension |(iV^ + N — 2). 

(c2a) [N even] A — > -JAJ: F = USp{N/2) = Sp{N/2,C) n U{N). This is a 
connected symmetric space; it is a subgroup of SU{N) of codimension |(iV^ — 
N-2). 

(c2b) [N = 4:k, k integer] A — > e^''''^JAJ: F = 0. 
(c2c) [N ^Ak + 2, k integer] A — > JAJ: F = 0. 

(dl) A — > A^: F = {UU^: U G SU{N)}. The manifold F is the connected 
symmetric space SU{N) / SO{N); it has codimension \{N'^ + N). 

(d2) [N even] A — > —A^: F is a disjoint union of two components Fi and F2 
with Fi = {UJU^: U G SU{N)} and F2 = {e'^'^'/^UJU^: U G SU{N)}. 
The manifold Fi is the connected symmetric space SU{N)/ USp{N/2); it has 
codimension ^{N'^ — N). The manifold F2 is isometric to Fi. 

Proof. Class (ar-r): This is immediate. 

Class (a^^s)- If ^ = IrAIs with A invertible, then = AI^A'^, i.e. is similar 
to Ig. But this is of course false if r 7^ s. 

Class (bl): II A — A\ then A has eigenvalues ±1 and it can be diagonahzed in 
the form A — UlrU'^ with U G SU{N); moreover, r is even because A G SU{N). 
Conversely, every matrix of this form is in SU{N) and satisfies A — A^. Obviously 
these manifolds are disjoint, as Ir cannot be similar to if r 7^ s. 

Let us now consider U,V e SU{N) such that UlrW = VIrVl Then WVIr = 
IrWV, i.e. WV commutes with 7^ and thus belongs to S{U{r) x U{N-r)). Thus V = 
UK with K G S{U{r) x U{N-r)). Conversely, ifV^UK with K G S{U{r) x U{N- 
r)), then UlrU^ = VlrV"^ . Therefore, the manifold F^. is in one-to-one correspondence 
with the cosets SU{N)/S{U{r) x U{N - r)). 
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Class (h2): \i A = e^^^'^^A^, then Ae~'^^^^ is a hermitian unitary matrix. One can 
then apply the same reasoning used for class (bl), but r must be odd rather than 
even. 

Class (cl): This is immediate. 

Class (c2a): For A G SU{N) C U{N), clearly A = -JAJ iff A^JA = A^Aj = J, 
i.e. iff A e USp{N/2). In gT], pp. 347-349] it is proven that USp{N/2) C SU{N) and 
that USp{N/2) is connected. 

Classes (c2b) and (c2c): If A = cJAJ and A G U{N), then A'^JA = -cA^AJ = 
—cJ. Therefore a/^A is a symplectic matrix, and thus pp. 347-349] det(v^^y4) = 
+1. But for A E SU{N) with even, this means that (— c)^/^ = +1, which is a 
contradiction in the two cases (A^ = 4fc, c = e^'^*/^) and (A^ = 4A; + 2, c = 1). 

Class (dl): \i A = A^ G SU{N), then [see the proof of case (cl). Theorem 
A = UU^ with U G SU{Ny Let us now consider U,V e SUiN) such that 



UU^ = VV^. Then WV = WV, i.e. WV is real and thus belongs to SO{N). 
Thus V = UK with K G 50 (A^). Conversely, if V = f/fsT with K G 50 (A^), then 
UU'^ = VV"^ . Therefore, the manifold F is in one-to-one correspondence with the 

cosets SU{N)/SO{N). 

Class (d2): \iA = -A^, then [see the proof of case (c2). Theorem |B^ A = UJU'^ 



oi A = e^^^^^UJU^ with U G SU{N), and only one of these two cases holds for any 
given A. 

Let us now consider U,V e SU{N) such that UJU^ = VJV^. Then WVJ{WVf 
J, i.e. WV belongs to USp{N/2). Thus V = UK with K G USp{N/2), and the con- 
verse also holds. Therefore, the manifold Fi is in one-to-one correspondence with the 
cosets SU{N)/USp{N/2). Clearly is isometric to Fi. 
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